Gene Mvan_3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3966 
Symbol 
ID4646958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4240212 
End bp4241222 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content69% 
IMG OID639807428 
Productregulatory protein, LacI 
Protein accessionYP_954749 
Protein GI120404920 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCACC GGTACAAGGT CCGTGAGATC GCCCAGCAGT CCGGATTGAG TGAGGCGACC 
GTGGACCGCG TCCTGCACAA CCGCCCCGGT GTCCGCGAGA ACACCGTCGC CGAGGTGAAC
CAGGCGATCG CCGATCTCGA CAAGCAACGG GCACAGCTGC GACTCAACGG GCGGCGCTAC
CTGATCGACG TGGTCATGCA GACACCGCGG CGATTCTCCG ACGCGTTCCG CGCCGCCGTC
GAGGCCGAAC TGCCCGCTTT CGCCCCCGCG ATGCTGCGCG CCCGCTTCCA CCTGTGGGAA
TCGGGGTCGA CGGCGCAGAT GGTGGAGGCC CTCGGCAGGA TCCGCGGCAG TCACGGCGTC
GTGCTCAAGG CACAGGACGA TCCCGCGGTC GCCGAGGCCG TCGACCGACT GGTCGACTCG
GGGGTACCGG TGGTCACCTA CACGACCGAC GTGCCGTCGA GCGCTCGCTG TGGCTACGTG
GGTATCGACA ACCACGGGGC CGGTGTGACG GCGGCCTACC TCGTGCAGCA GTGGCTGGGC
GAGGCGCCGG CGGATGTGCT GATCACATTG AGCCGCACGG TGTTCCGTGG TGAAGGTGAG
CGTGAGGTCG GATTCCGGTC GGCACTGCGC AACTGCGGGC GCACGATTGT GGAGGTCAGC
GACAGCGACG GTATCGACGC CACCAACGAA CGGCTGGTGC TCGACGCGCT GGCCGCCAAC
CCCGGTGTCC AGGCGGTGTA CTCGGTGGGC GGCGGGAACG TCGCGACCGT CGCGGCGTTC
GAGAAGATCG GCCGCGACTG CAAGGTGTTC ATCGCCCACG ACCTGGACGC CGACAACCGG
CGGTTGCTGA GGGACGGCCG GATCTCGGCC GTGCTGCACA ACGATCTGCG CGCCGACGCC
CGGCTGGCGC TGCGGCTCAT CCTCCAGGAG CGCGGAGCTT TGCCGGTGGA ACCGGTGCGG
CCGGTGCCGA TCCAGGTCGT GACGCCGTAC AACCTGCCCG TGCACCCATA G
 
Protein sequence
MAHRYKVREI AQQSGLSEAT VDRVLHNRPG VRENTVAEVN QAIADLDKQR AQLRLNGRRY 
LIDVVMQTPR RFSDAFRAAV EAELPAFAPA MLRARFHLWE SGSTAQMVEA LGRIRGSHGV
VLKAQDDPAV AEAVDRLVDS GVPVVTYTTD VPSSARCGYV GIDNHGAGVT AAYLVQQWLG
EAPADVLITL SRTVFRGEGE REVGFRSALR NCGRTIVEVS DSDGIDATNE RLVLDALAAN
PGVQAVYSVG GGNVATVAAF EKIGRDCKVF IAHDLDADNR RLLRDGRISA VLHNDLRADA
RLALRLILQE RGALPVEPVR PVPIQVVTPY NLPVHP