Gene Mvan_5229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5229 
Symbol 
ID4645244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5598893 
End bp5600056 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content67% 
IMG OID639808704 
Producthypothetical protein 
Protein accessionYP_956006 
Protein GI120406177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.340919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.127707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTCCGG ATAGAACGCG TTCTAGTTTT GGGGCTGTGT TCACGCAGCC GCTGGCCGAG 
GCCATCGCCG AGGCCGAGAA ACTCGTCGCC GCCGCCCCGT TCATCGAATC CGAGGCGGAC
CTGCTCGAGG GGCTGCAGTA TCTGGCCGGC TGCGTCGCGG CGTGCACGCA CGTCGCGTTC
GACTACGACC GCGACCACCC CTTCCTGCAC AGCGGCACCG GCCCGTTTAC CAAGATGGGT
CTCGACAACC CCGACACCAT GTACTTCGGC ACCCGTGTGC AGCCCGGCCA CGAGTACGTG
GTCACCGGCA GGCGCGGCAC CACCACCGAC GTCAGCTTCC AGCTTCTCGG CGGCGAATAC
ACCGACGAGG TGGTCCCGGA CAGCGAGACG GCGTTCGACG ACCGCAAGCT CGACATCGCC
GCCGACGGCA CCTTCGAATG GCGGTTCACC CCGAAGGTGC CGTCCCAGCT CGTCATCCGC
GAGGTCTACA ACGACTGGTC CGCCCAGCGC GGCACCTTCG CGATCGCGCG CACCGACACC
GCGGGCACCG CACCGCCGCC GCTGACGCGC GAGCTCATCG AGAAGCGCTA CGCCGTCGCC
GGGAAGCAGC TGGTGCAGCG CGTCAAGACG TGGTTGCAGT TTCCTCAGTG GTTCTACAAC
GACACCCAGC CGAACTCGAT GGTGGCGCCC CGGCTCACCC CCGGCGGGCT GGCCACCCAG
TACTCGTCGG CGGGACAGTT CGATCTCGCC GAGGATCAGG CGCTGATCAT CACGCTTCCG
GTCACCGACG CGCCCTACCT CGGGTTCCAG CTGGGCAGCC TCTGGTACAT CTCGCTGGAC
TACATCAACC ACCAGACGTC GTTGAACGGC ACTCAGGCGC AGGCGGACCC GGACGGCATG
ATCCGTATCG TCGTCGCCGA CCGCAATCCC GGCGTGACGA ACTGGGTGGA GACACTCGGG
CACCGCAAGG GCTTCCTGCA GTTCCGCTGG CAGCGGGTGT CGCGTGAGCT GACGCCCGCC
GACGGGCCGA CCGTGGAGCT GGTCGACATC GACAAGGTCG CCGCGGCACT GCCGTACTAC
GAATCCAACA CGATCTCGGA ACAGGACTGG CGGGCGCGGA TAGCGCTGCG CCAGAAGCAG
ATCGGCGAAA GAATGGTGGG TTGA
 
Protein sequence
MTPDRTRSSF GAVFTQPLAE AIAEAEKLVA AAPFIESEAD LLEGLQYLAG CVAACTHVAF 
DYDRDHPFLH SGTGPFTKMG LDNPDTMYFG TRVQPGHEYV VTGRRGTTTD VSFQLLGGEY
TDEVVPDSET AFDDRKLDIA ADGTFEWRFT PKVPSQLVIR EVYNDWSAQR GTFAIARTDT
AGTAPPPLTR ELIEKRYAVA GKQLVQRVKT WLQFPQWFYN DTQPNSMVAP RLTPGGLATQ
YSSAGQFDLA EDQALIITLP VTDAPYLGFQ LGSLWYISLD YINHQTSLNG TQAQADPDGM
IRIVVADRNP GVTNWVETLG HRKGFLQFRW QRVSRELTPA DGPTVELVDI DKVAAALPYY
ESNTISEQDW RARIALRQKQ IGERMVG