Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_0704 |
Symbol | |
ID | 4643647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 746801 |
End bp | 747799 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639804204 |
Product | extracellular solute-binding protein |
Protein accession | YP_951548 |
Protein GI | 120401719 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.721157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0331446 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCTT GCGCGAAGAA GAAACAGCCG CTGATGAAGG TGTCTGCATG GGGCGCGCTG CTGGCGGGGG TGCTGGTGTT GGGGGGGTGC GCGCAGACGT CGCCGGTGGT GCCGACACCG AGTGTCACGC TGGCGCCGCC GACGCCTGCG GGGCTGGAGG AGATGCCGCC GGAGCCTGCG CGTGCACCGA CCGCCGCGGA CGACGACTGT GACCCCCTGG CCAGCCTGCG CCCGTTCGAC AACAAGGAAG ACGCCGACAA GGCGGTGGCC AACATCAAGG CCAGGGGCAG GCTCATCGTC GGCCTCGACA TCGGCAGCAA CCTGTTCAGC TTCCGCGACC CGATCACCGG CGAGATCACC GGCTTCGACG TCGACATCGC CGGTGAGATC GCGCGCGACA TCTTCGGCAC CCCGTCGCAG GTGGAATACC GCATCCTGTC TTCGGCGGAT CGCGTCGAGG CGCTGCAGAA GAACCAGGTC GACGTGGTCG TCAAGACGAT GACGATCACC TGTGAGCGCA AGAAACTGGT GAACTTCTCG ACTGCGTACC TGTCCGCCAA CCAGCGCATC CTGGCACCGC GGGATTCGAA CATCCGGCAG TCGTCCGACC TGTCGGGCAA GCGGGTCTGT GTCGCCAAGG GCACCACGTC GCTGGAACGC ATCCAGCAGA TCACGCCGCC GCCGATCATC GTCGGCGTGG TCACCTGGGC GGACTGCCTG GTCGCGTTGC AGCAGCGGCA GGTCGACGCT GTCAGCACCG ACGACTCGAT CCTGGCCGGG CTGGTGTCCC AGGACCCCTA TCTGCACATC GTGGGACCGT CGATGAACGA GGAGCCTTAC GGCATCGGTG TCAACCTGGA AAACACCGGG CTGGTGCGCT TCGTCAACGG GACGCTGCAG CGCATCCGGC GCGACGGCAC CTGGAACACG CTGTACCGCA AGTGGTTGAC CGTACTCGGG CCAGCGCCCG CGCCCCCCGC CGCGAGGTAC TCGGACTGA
|
Protein sequence | MSACAKKKQP LMKVSAWGAL LAGVLVLGGC AQTSPVVPTP SVTLAPPTPA GLEEMPPEPA RAPTAADDDC DPLASLRPFD NKEDADKAVA NIKARGRLIV GLDIGSNLFS FRDPITGEIT GFDVDIAGEI ARDIFGTPSQ VEYRILSSAD RVEALQKNQV DVVVKTMTIT CERKKLVNFS TAYLSANQRI LAPRDSNIRQ SSDLSGKRVC VAKGTTSLER IQQITPPPII VGVVTWADCL VALQQRQVDA VSTDDSILAG LVSQDPYLHI VGPSMNEEPY GIGVNLENTG LVRFVNGTLQ RIRRDGTWNT LYRKWLTVLG PAPAPPAARY SD
|
| |