Gene Mvan_5742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5742 
Symbol 
ID4644197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6132204 
End bp6133355 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content69% 
IMG OID639809218 
Productextracellular solute-binding protein 
Protein accessionYP_956513 
Protein GI120406684 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.252794 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCT CACCCAACCC CCCACGGTTG ACGTCCCGGG TTTTCGCGGT GGCCGCGTCG 
GCGCTGTTGT TCGGGAGTGC CGTGGCGTGC GCGCCGCCCG AGAAGGACAA CTCCAACGCC
CAGACCGAGT CCGGGGTGAA CGCGGCCGAG GCCACCTCGG CAGGGGATTT CGGCGGCATG
GAGGGGCTCG TCGAGGCGGC CAAGGCCGAG GGTGAGCTCA ATGTGATTGC GCTGCCGCCG
GATTGGGCGA ACTACGGCGC GATCATCAAG GCGTTCTCCG ACAAGTACGG CATCAAGGTC
AACTCCGCGC AGCCCGACGC CTCCAGCCAG GACGAGATCA ACGCCGCCAA CCAGCAGAAG
GGCCGCAGCA GCGCCCCCGA CGTGTTCGAC CTCGGCCAGT CGGTGGCGCT GGCCAACACG
GCGATGTTCG CGCCGTACAA GGTGGAGACG TTCGACGACA TCCCCGCGGC GTTCAAGGAC
GCCGACGGCA CCTGGGTCAA CGATTACGGC GGCTACATGT CGATCGGGTT CGACTCGTCC
AAGGTGCCGC CGGTGACCAG CGTCGACGAC CTGCTCAAGC CGGAGTACCA GGGCAAGGTG
GCCCTCAACG GTGATCCGAC GCAGGCGGGT GCGGCGTTCT CCGGTGTCCT GATGGTGGCG
TTGTCGCAGG GCGGCTCGGC CGACGACATC GCACCCGGCG TCGAGTTCTT CCGCAAACTC
AAGGAGGCGG GCAACTTCCT GCCGGTCGAC CCGACCCCGG CCACCATCGA GTCCGGGCAG
ACGCCCGTGG TGATCGACTG GAACTACACC AACTCCGCCG AGACGAAGAA GCTGCCGTCG
TGGACGGTGC TGGTGCCGCC GGAGAACCCG GTGGCCGGGT ACTACTACCA GGCGATCAAC
AGGGACGCCC CGCATCCCGC CGCCGCGCGG TTGTGGCAGG AGTTCCTCTA CAGCGACGAG
GGCCAGAACC TGTTCGCCCA GGGCGGGGTG CGGCCGGTGC GGGCGGACAA CATGCTCGCC
GACGGCACCC TCGATCCGGC GGTCGCCGCG GCGTTGCCGG TGGTCGACGG CCCGGTGACC
GTGCCCACGC CGCAGCAGAC CGAGGCGGCG TCGAAGTACC TCGCGGAGAA CTGGGCCGCC
GCGGTCGGCT GA
 
Protein sequence
MNTSPNPPRL TSRVFAVAAS ALLFGSAVAC APPEKDNSNA QTESGVNAAE ATSAGDFGGM 
EGLVEAAKAE GELNVIALPP DWANYGAIIK AFSDKYGIKV NSAQPDASSQ DEINAANQQK
GRSSAPDVFD LGQSVALANT AMFAPYKVET FDDIPAAFKD ADGTWVNDYG GYMSIGFDSS
KVPPVTSVDD LLKPEYQGKV ALNGDPTQAG AAFSGVLMVA LSQGGSADDI APGVEFFRKL
KEAGNFLPVD PTPATIESGQ TPVVIDWNYT NSAETKKLPS WTVLVPPENP VAGYYYQAIN
RDAPHPAAAR LWQEFLYSDE GQNLFAQGGV RPVRADNMLA DGTLDPAVAA ALPVVDGPVT
VPTPQQTEAA SKYLAENWAA AVG