Gene Mvan_3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3094 
Symbol 
ID4646850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3261533 
End bp3262888 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content69% 
IMG OID639806571 
Producthypothetical protein 
Protein accessionYP_953902 
Protein GI120404073 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.292492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.43311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA ATTTGTCGTT GCTGGCGTTG TTGGCGTTCA TGCTGCTCAC CGCGGGAACC 
GCGGTGTTCG TCGCCGCCGA GTTCTCCCTG ACCGCGCTCG AACGCAGCAC CGTCGAGGCC
AACGTCCGCT CAGGGGACCG CCGCGATGTG ATGGTGCAGC GTGCCCACCG CACGCTGTCG
ACGCAGTTGT CCGGAGCGCA GGTCGGCATC TCCATCACCA CGCTGGCCAC CGGTTTCCTG
GCCGAACCCG TCGTCGCCCG GCTGATCCAC CCCGGCCTGA CCGCCATCGG GATCCCGGAC
CGGTTCGTCG GCGGTCTGGC GTTGACCCTC GCCATCCTGA TCGCCACCTC GATCTCGATG
GTGTTCGGCG AGCTGGTGCC CAAGAACCTC GCGGTGGCCC GCCCGGTGCC GACCGCGCGG
TGGTCGGCAC CGCTGCAGCT GATGTTCTCG TTCCTGTTCA CGCCCCTGAT CCGGCTGACC
AACGGCACCG CGAACTGGAT CCTGCGGCGG CTCGGCATCG AACCGGCCGA GGAACTGCGC
TCGGCACGCT CACCCCAGGA GCTGGTGTCG CTGGTGCGGT CCTCGGCCGA ACGCGGATCG
CTGGACCCGG TCACGGCGCT GCTGGTGGAC CGCTCCCTGC AGTTCGGCGA CCGCTCCGCC
GAAGAGCTGA TGACGCCGCG GTCCAAGATC GACACGCTGG AGGCCGACGA CACGGTCGCC
GACCTCAGCG ACGCCGCGAC CCGAACGGGC CACTCCCGCT TCCCCGTCAT CCGCGGTGAC
CTCGACGAAA CCGTCGGCAT GGTGCACGTC AAACAGGTGT TCGCCGTGCC GGCCGACGCC
CGCGCGACAA CCAGGCTGGC CACCCTGGTC CAGCCCGTCA CCAAGGTGCC TTCGACGCTC
GACGGGGATG CGGTGATGTC GGAGGTGCGC GCCAACGGTC TGCAGACCGC GTTGGTGGTC
GACGAATACG GCGGCACCGC GGGCATGGTG ACGGTCGAGG ATCTGATCGA GGAGATCGTC
GGCGATGTGC GCGACGAACA CGACGTCGAA CCGCCCGACG TGGTGCAGGC CGGCCGTGGC
TGGCAGGTCT CCGGTCTGCT GCGCATCGAC GAGGTGGCTC AGGGCACCGA GTTCCGGGCA
CCTGAAGGCG ACTACGAAAC CATCGGCGGT CTGGTGCTGG AGAAGCTCGG CCACATACCG
GAGGAAGGCG AGTCGGTGGA GCTGATCGCC TTCGACCCGG ACGGCCCGAT CCAGGATCCG
GTGCACTGGC TGGCGACCGT GGTCAAGATG GACGGCCGCC GCATCGACCA GCTGCGGCTG
ACCGAACTCG GCCGCAAGGG AGACAGCCGT GGGTGA
 
Protein sequence
MSSNLSLLAL LAFMLLTAGT AVFVAAEFSL TALERSTVEA NVRSGDRRDV MVQRAHRTLS 
TQLSGAQVGI SITTLATGFL AEPVVARLIH PGLTAIGIPD RFVGGLALTL AILIATSISM
VFGELVPKNL AVARPVPTAR WSAPLQLMFS FLFTPLIRLT NGTANWILRR LGIEPAEELR
SARSPQELVS LVRSSAERGS LDPVTALLVD RSLQFGDRSA EELMTPRSKI DTLEADDTVA
DLSDAATRTG HSRFPVIRGD LDETVGMVHV KQVFAVPADA RATTRLATLV QPVTKVPSTL
DGDAVMSEVR ANGLQTALVV DEYGGTAGMV TVEDLIEEIV GDVRDEHDVE PPDVVQAGRG
WQVSGLLRID EVAQGTEFRA PEGDYETIGG LVLEKLGHIP EEGESVELIA FDPDGPIQDP
VHWLATVVKM DGRRIDQLRL TELGRKGDSR G