Gene Mvan_3877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3877 
Symbol 
ID4649194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4147128 
End bp4149134 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content69% 
IMG OID639807343 
Productglycoside hydrolase 15-related 
Protein accessionYP_954664 
Protein GI120404835 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.929233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.274692 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCTGC CGCAGACCGA GACCTCCGAC GGCGTGTCCC CCAACGGTGA CGGAGGGGCG 
TTCGCGCTGT CCAGCCCGGC GGCGTATCCC AGCTCCGGGC CGCTGCGCAA CCCGTTCCCG
CCGATCGCCG ACTACGCGTT CCTGTCCGAC TGCGAAACGC AGTGCCTGAT CTCGTCGGCC
GGCTCGGTGG AGTGGCTGTG CGTGCCGCGG CCCGACTCGC CCAGCGTGTT CGGCGCGATC
CTGGACCGCG GCGCCGGCCA CTTCCGGCTC GGTCCGTACG GGGTGTCGGT GCCCGCGGCG
CGGCGCTATC TGCCCGGCAG CCTGATCCTG GAGACCACCT GGCAGACCCC CACCGGCTGG
GTGATCGTGC GCGACGCCCT CGTGATGGGA CCGTGGCACG ATCTCGACAC CCGCTCCCGG
ACCCACCGCC GCACGCCGAT GGACTGGGAT GCCGAGCACA TCCTGCTGCG CACCGTCCGG
TGCGTCAGCG GCACGGTGGA ACTGGTGATG AGCTGCGAGC CGTCGTTCGA CTACCACCGC
ACCAGCGCGC ACTGGGAGTA TTCGGCGCAG GCCTACGGCG AGGCCATCGC GCGGGCCACC
AAGAACCCGG ACTCCCATCC GACGCTGCGG CTGACCACCA ATCTGCGGAT CGGTCTGGAG
GGTCGGGAGG CCCGGGCCCG GACCCGTTTG AAGGAAGGGG ACAACGTCTT CGTCGCGCTG
AGCTGGTCCA AGCACCCGGC GCCCCAGAAC TACCAGGAGG CCGCCGACAA GATGTGGACG
ACCAGCGAAT GCTGGCGCCA GTGGATCAAC GTCGGTGACT TCCCCGACCA CCCGTGGCGG
GCCTACCTGC AACGCAGTGC GCTGACGCTG AAGGGTCTGA CCTACTCCCC GACCGGCGCG
CTGCTCGCCG CGCCGACCAC GTCGCTGCCG GAGAGTCCTC AAGGCGAACG CAACTGGGAC
TACCGCTACG CCTGGGTGCG CGACTCCACG TTCGCACTCT GGGGCTTGTA CACGCTGGGC
CTGGACCGCG AGGCCGACGA CTTCTTCGCG TTCATCGCCG ACGTGTCGGG CGCCAACAAC
GGAGACCGGC ACCCGCTGCA GGTGATGTAC GGCGTCGGCG GGGAACGCAG CCTGGTCGAG
GAGGAGCTCA ACCACCTGTC GGGCTACGAC AACGCCCGAC CGGTCCGCAT CGGCAACGGC
GCCTACAACC AGATGCAGCA CGACATCTGG GGCACCCTGC TCGATTCGGT CTACCTGCAC
ACCAAGTCGC GCGAGCAGAT CCCCGAGACA CTGTGGCCGG TGCTCAAGGA ACAGGTCGAG
GAGGCGGTCA AGCACTGGCG CGAGCCCGAC CGCGGCATCT GGGAGGTGCG CGGAGAACCG
CAGCACTTCA CCTCCAGCAA GATCATGTGC TGGGTGGCGC TCGACCGGGG CGCCAAGCTC
GCCGAGTTCG AGGGCGAGAA GTCCTACGCC CAGCAGTGGC GCGCGATCGC CGAGGAGATC
AAGGCCGACA TCCTCGAACA CGGCGTCGAC GAGCGCGGTG TGCTGACCCA GCGCTACGGG
CACGACGCGC TGGACGCGTC ACTGCTGTTG GCGGTGCTGA CCCGGTTCCT GCCACCCGAC
GATCCGCGGA TCCGGGCGAC GGTGCTGGCC ATCGCCGACG AGCTGACCGA AGAGGGCCTG
GTGCTGCGCT ACCGGGTCGA GGAGACCGAC GACGGGCTGT CCGGCGAAGA GGGCACGTTC
ACGATCTGCT CGTTCTGGCT GGTGTCGGCG CTGGTCGAGA TCGGCGAGAT CCATCGGGCC
CGGCATCTGT GTGAGCGGCT GCTGTCGTTC GCCAGCCCGC TGCACCTCTA CGCCGAGGAG
ATCGAACCGC GCACCGGGCG GCACCTGGGC AACTTCCCGC AGGCGTTCAC CCACCTGGCG
CTGATCAACG CCGTGGTGCA CGTGATCCGC GCCGAGGAAG AGGCCGACAG CTCCGGGGTG
TTCCAGCCGG CGAACGCCCC CGTATAA
 
Protein sequence
MVLPQTETSD GVSPNGDGGA FALSSPAAYP SSGPLRNPFP PIADYAFLSD CETQCLISSA 
GSVEWLCVPR PDSPSVFGAI LDRGAGHFRL GPYGVSVPAA RRYLPGSLIL ETTWQTPTGW
VIVRDALVMG PWHDLDTRSR THRRTPMDWD AEHILLRTVR CVSGTVELVM SCEPSFDYHR
TSAHWEYSAQ AYGEAIARAT KNPDSHPTLR LTTNLRIGLE GREARARTRL KEGDNVFVAL
SWSKHPAPQN YQEAADKMWT TSECWRQWIN VGDFPDHPWR AYLQRSALTL KGLTYSPTGA
LLAAPTTSLP ESPQGERNWD YRYAWVRDST FALWGLYTLG LDREADDFFA FIADVSGANN
GDRHPLQVMY GVGGERSLVE EELNHLSGYD NARPVRIGNG AYNQMQHDIW GTLLDSVYLH
TKSREQIPET LWPVLKEQVE EAVKHWREPD RGIWEVRGEP QHFTSSKIMC WVALDRGAKL
AEFEGEKSYA QQWRAIAEEI KADILEHGVD ERGVLTQRYG HDALDASLLL AVLTRFLPPD
DPRIRATVLA IADELTEEGL VLRYRVEETD DGLSGEEGTF TICSFWLVSA LVEIGEIHRA
RHLCERLLSF ASPLHLYAEE IEPRTGRHLG NFPQAFTHLA LINAVVHVIR AEEEADSSGV
FQPANAPV