Gene Mvan_5243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5243 
Symbol 
ID4645258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5614797 
End bp5616878 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content67% 
IMG OID639808718 
Producthypothetical protein 
Protein accessionYP_956020 
Protein GI120406191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.751292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.524557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGC ACCGCGACGC ACCTCCACAC TCCGGGGAAG CATCGGCTGC CGGGCCCACT 
CAGAGCAACT TTGAATTTGC TGACGCGAAA TTTGCCAGAA CTTCAATTAC AGGTCCGTCC
CCTTCGTCAA GGGCCACAGA TCTCAAGCGA ATCGGGCGCA TCGGTGCGCT GGCAGTGGCG
CTGGGAGTGG GCGCCGCGAT CGCCGCCGTC CCGGGAGTCG CCTTCGCGGA CACCGCGGAG
GGAGGTTCGC CAGAATCCTC GGCAGAGCGA AGCCAGGAAT CGAGCTCCGG CACCGAGGCC
GCCCCGCCGC AGCACCGGTC GGCCCGGGAG AAGCCGGCCG ATCGAGATGA CAAGCCGTCA
CCGCCCACCC GCCGCGGTGC CACGAGGACC GCCGCGACCG AAGCGGCGTC CGGGCGCGAT
CGCCCCACAC GTGACACCGA GTCTCCCGCG TCCGGCGTCG GCGAGAGCGC CGTAACGTCC
GATGGCGACG ACCGACAACC GGCCGAGCAG TCCGTTGCGA CCACCACGAC GACGCTGAGG
TCCGCAACAT CCTCGTCGAG CCAACCTGCC GGCGTCCCGG ACAGCCCGGC GCAGGAATCC
GTCCTGCTGG CGGCCTCCGC GTCGATCGGG CGCCCGAACG AGGCCCGGAC CGCGATCCCT
TCCCCCGCCG CGAGAAGCGC GCAGACATCA GTGCTCACCG AGGCGCCCCA GCAGGCCAGC
TCGACAGCTG CCGACGACGC AGACGCCCAG GACCGGGACG TCGTCATCGA TGCCGACTCC
CTGACCGTGT CGCGGCCGTC GAACGGCACC ACCTACACCG ATCCCACCGC TTCGGGTGGC
ACCGCGTTAC TACTTCATCG AAACGCGACC GCCTCGACGA CCGTGACCCT TCCGGAGTTC
ACCAGCCTCG TGATCCGCGC AAAAGGCGAC CAGTACCGTG GTGCGCCCGA GATGAGGGTG
TCCGTCAACG GCAAAGTCGT GTCCAAGGTG GCTGTCACGG CCACGTCATG GACCGACTAC
ACCGTTCCCT TCAGCGGTCC GGCCGGCACC TACACCCTGA GCGTCGCCTT CACCAACGAT
CGGTACTCAC GGCGCAACGG CGACCGAAAC CTGCGGCTCG ATACCGTCAC CGTGGTCTCA
GCCGTCGTCG ACCCGGAGCC GCCCACCCCG CCGAGCACCG GTTCGCCTGG GTATTTCGAA
GGCGCCGACT GGTTGTGGAA GCCGATTGCC ACCAACCCGG TGTTGGCCAC CAACAGCGCG
ACGTGGGTGA ACTACCTGGC AGCGCCGGAC AAGCTGCGGA TCGCCAACCT GTACGACTAC
TCGGTGGCGC TGGTGTCGGC CTCGGAAATC ACTTCGAGCA CACCGCGATA CGACGTCGCC
TTCACCGAAC CGTGGGGCAG TGACCCGTTC GGCAACACCA CGGTGCCGAT TCCACTCGGA
ACCAAGGTGC CCCCGGGGAG CGATGGCCAC GTCGCGATCC TCGATCCGAC GACAGGGCTG
GCGTACGGCA TCTGGCAGGC GAAGTACAAC AGCTCGACCA ACACGTGGTC CGGGTCGTGG
GGCGGAATGA CGGACCTCGA CGGTGACGGA ATCGACCAGT CGGGATCGGC GACGGCCGCC
GCCATCGCCC GGTATGCCGG CGTGGTCACG GCAGCGGAAC TCAGTGCGGC CATCGCCGCG
AACACCGGGA TCAACCACGC GCTCGCCTTC TCCACCGACC TCGCGGGACC CGACTTCGTC
TACCCCGCAG CGAAATCGGA CGGCCAGAAC TGGGCCGGTG TCGCGGTGCC GATACCCGAG
GGTTACCGGA TTCAACTCGA CCCGAGCATC GACGTCGACG CCATCCCGGG GATGACGCCG
GGCGAGAGGG TGATCGCCAA GACTCTGCAG ACCCACGGCG CCTACGTCGT CGACCAAGGC
TCAGCCCGGA TGGCCTTCGC CTTTGAACTG CTTGACGACG CAACCTCAAC CTCACCGGGT
TCGGTCTGGG TCGACGCCGG ATTCGCCTGG GATTACTACG ACATGAACGC CATCCCGTGG
TCCCAGTTGA GGGTGCTCGC ACCCACCTCC GTCTCGGTGT AG
 
Protein sequence
MSSHRDAPPH SGEASAAGPT QSNFEFADAK FARTSITGPS PSSRATDLKR IGRIGALAVA 
LGVGAAIAAV PGVAFADTAE GGSPESSAER SQESSSGTEA APPQHRSARE KPADRDDKPS
PPTRRGATRT AATEAASGRD RPTRDTESPA SGVGESAVTS DGDDRQPAEQ SVATTTTTLR
SATSSSSQPA GVPDSPAQES VLLAASASIG RPNEARTAIP SPAARSAQTS VLTEAPQQAS
STAADDADAQ DRDVVIDADS LTVSRPSNGT TYTDPTASGG TALLLHRNAT ASTTVTLPEF
TSLVIRAKGD QYRGAPEMRV SVNGKVVSKV AVTATSWTDY TVPFSGPAGT YTLSVAFTND
RYSRRNGDRN LRLDTVTVVS AVVDPEPPTP PSTGSPGYFE GADWLWKPIA TNPVLATNSA
TWVNYLAAPD KLRIANLYDY SVALVSASEI TSSTPRYDVA FTEPWGSDPF GNTTVPIPLG
TKVPPGSDGH VAILDPTTGL AYGIWQAKYN SSTNTWSGSW GGMTDLDGDG IDQSGSATAA
AIARYAGVVT AAELSAAIAA NTGINHALAF STDLAGPDFV YPAAKSDGQN WAGVAVPIPE
GYRIQLDPSI DVDAIPGMTP GERVIAKTLQ THGAYVVDQG SARMAFAFEL LDDATSTSPG
SVWVDAGFAW DYYDMNAIPW SQLRVLAPTS VSV