Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4861 |
Symbol | |
ID | 4643839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 5201959 |
End bp | 5203089 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639808332 |
Product | glycoside hydrolase family protein |
Protein accession | YP_955640 |
Protein GI | 120405811 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.181594 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0880235 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAGTC GGTTGCGTAA GGCCATGTTG CTGCTGGTTG CGCTGTTGCT CGTCGTCGGC GCAGGGTGGG CCGCCGCGCA GGCGGTGGAC CCACGCATGT CCGACGACGA CGTCCGCGGG ATGCGCGAAC AGGCGGCGCG ACAGGCCGGC GAGGACTTCC TGTCCAGGTA CGTCGAGAGC GACGGTCGCG TCGTCCGACG CGACGAAGGC GGCGATGTCG TCAGCGAAGG GCAGGCCTAC GGCATGCTGA TCGCCGCCGC GCTCGGCGAC GAGCCCCGCT TCCGGGCGAT CTGGGACTGG ACCCGGACCC ACCTGCGCCG CCCCGACGGG TTGTTGTCGT GGCGCTGGGC CGACGGTCGG GTCACGGACC CGAGCAGTGC CACCGACGCG GACCTGGACG CTGCCCGCTC GCTGCTGCTC GCGGCGCGGC GATTCGCCGC GCCGGAACTG GCCGAGGACG GGAAGCGGCT CGGTGCCGAC GTGTTGCGGG GAGAGACCGT GACCGTCGGA GCGGCGCCGT CGCCCGCCAT GGCCCGACCC GGGTTGATCA CCGTCGCGGG TAACTGGGCG ACCGCGCCGC CGCATGCCGT GGACCCCGGG TACTTCAGTC GCCGCGCCGA GCGGGAGCTG CTGGACGCCT CGGCCGACCG GCGGTGGCTC GATGTCAGCA GAACCCAGCG TGTGCTGGTC TGGCAGCTGA TCGGCACGGC TTCACTGCCG CCGGACTGGG CGTCGGTCGA CCCGGCCGGG CGCGCGGTGC CGACGGGTCC ACCCGACGGA GGACCCACCC GGTTCGGGCT GGATGCCGCA CGGCTACCGA TCCGTTTCGC GGAGTCGTGC GACCCGGCGG ACCGTGCGGT GGCGGTGTCA CTGCGCCGGG TGGTCGCCGC GTCGCGCGAC ATCCCGGCGA CCCGCAACCT GGACGGGTCG GCCGCAGGCG AGTGGCAGCA TCCCGTCGCG CTGGTCAGTG CCGCGGCGAC CGATCACGCG GCGGGTGACC GTGAGGCAGG CGCGGCCCGA CTGGATCAGG CCTCCGCGTT GCAGCAGCGC TATCCGACCT ATTTCGGGGC CGCCTGGGTC GCCCTCGGCA GGATCATGCT GGACACGTCG CTGCTCGGCG AGTGCGCATA A
|
Protein sequence | MVSRLRKAML LLVALLLVVG AGWAAAQAVD PRMSDDDVRG MREQAARQAG EDFLSRYVES DGRVVRRDEG GDVVSEGQAY GMLIAAALGD EPRFRAIWDW TRTHLRRPDG LLSWRWADGR VTDPSSATDA DLDAARSLLL AARRFAAPEL AEDGKRLGAD VLRGETVTVG AAPSPAMARP GLITVAGNWA TAPPHAVDPG YFSRRAEREL LDASADRRWL DVSRTQRVLV WQLIGTASLP PDWASVDPAG RAVPTGPPDG GPTRFGLDAA RLPIRFAESC DPADRAVAVS LRRVVAASRD IPATRNLDGS AAGEWQHPVA LVSAAATDHA AGDREAGAAR LDQASALQQR YPTYFGAAWV ALGRIMLDTS LLGECA
|
| |