Gene Mvan_4861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4861 
Symbol 
ID4643839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5201959 
End bp5203089 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content73% 
IMG OID639808332 
Productglycoside hydrolase family protein 
Protein accessionYP_955640 
Protein GI120405811 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.181594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0880235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGTC GGTTGCGTAA GGCCATGTTG CTGCTGGTTG CGCTGTTGCT CGTCGTCGGC 
GCAGGGTGGG CCGCCGCGCA GGCGGTGGAC CCACGCATGT CCGACGACGA CGTCCGCGGG
ATGCGCGAAC AGGCGGCGCG ACAGGCCGGC GAGGACTTCC TGTCCAGGTA CGTCGAGAGC
GACGGTCGCG TCGTCCGACG CGACGAAGGC GGCGATGTCG TCAGCGAAGG GCAGGCCTAC
GGCATGCTGA TCGCCGCCGC GCTCGGCGAC GAGCCCCGCT TCCGGGCGAT CTGGGACTGG
ACCCGGACCC ACCTGCGCCG CCCCGACGGG TTGTTGTCGT GGCGCTGGGC CGACGGTCGG
GTCACGGACC CGAGCAGTGC CACCGACGCG GACCTGGACG CTGCCCGCTC GCTGCTGCTC
GCGGCGCGGC GATTCGCCGC GCCGGAACTG GCCGAGGACG GGAAGCGGCT CGGTGCCGAC
GTGTTGCGGG GAGAGACCGT GACCGTCGGA GCGGCGCCGT CGCCCGCCAT GGCCCGACCC
GGGTTGATCA CCGTCGCGGG TAACTGGGCG ACCGCGCCGC CGCATGCCGT GGACCCCGGG
TACTTCAGTC GCCGCGCCGA GCGGGAGCTG CTGGACGCCT CGGCCGACCG GCGGTGGCTC
GATGTCAGCA GAACCCAGCG TGTGCTGGTC TGGCAGCTGA TCGGCACGGC TTCACTGCCG
CCGGACTGGG CGTCGGTCGA CCCGGCCGGG CGCGCGGTGC CGACGGGTCC ACCCGACGGA
GGACCCACCC GGTTCGGGCT GGATGCCGCA CGGCTACCGA TCCGTTTCGC GGAGTCGTGC
GACCCGGCGG ACCGTGCGGT GGCGGTGTCA CTGCGCCGGG TGGTCGCCGC GTCGCGCGAC
ATCCCGGCGA CCCGCAACCT GGACGGGTCG GCCGCAGGCG AGTGGCAGCA TCCCGTCGCG
CTGGTCAGTG CCGCGGCGAC CGATCACGCG GCGGGTGACC GTGAGGCAGG CGCGGCCCGA
CTGGATCAGG CCTCCGCGTT GCAGCAGCGC TATCCGACCT ATTTCGGGGC CGCCTGGGTC
GCCCTCGGCA GGATCATGCT GGACACGTCG CTGCTCGGCG AGTGCGCATA A
 
Protein sequence
MVSRLRKAML LLVALLLVVG AGWAAAQAVD PRMSDDDVRG MREQAARQAG EDFLSRYVES 
DGRVVRRDEG GDVVSEGQAY GMLIAAALGD EPRFRAIWDW TRTHLRRPDG LLSWRWADGR
VTDPSSATDA DLDAARSLLL AARRFAAPEL AEDGKRLGAD VLRGETVTVG AAPSPAMARP
GLITVAGNWA TAPPHAVDPG YFSRRAEREL LDASADRRWL DVSRTQRVLV WQLIGTASLP
PDWASVDPAG RAVPTGPPDG GPTRFGLDAA RLPIRFAESC DPADRAVAVS LRRVVAASRD
IPATRNLDGS AAGEWQHPVA LVSAAATDHA AGDREAGAAR LDQASALQQR YPTYFGAAWV
ALGRIMLDTS LLGECA