Gene Mvan_4301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4301 
Symbol 
ID4648403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4609098 
End bp4611188 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content67% 
IMG OID639807769 
Productalpha amylase, catalytic region 
Protein accessionYP_955084 
Protein GI120405255 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.844806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCG GTCGTATCGA GATCGATGAC GTCCAGCCCG TGGTGTCCGA AGGGCGGTTT 
CCCGCCAAAG CGGTCATCGG CGAAATCGTC CCGGTGACGG CAACCGTATG GCGGGAAGGC
CACGACGCGG TGTCGGCGAC GCTCGTGGTG CGCTACCACG GGACCGGTTA TCCGCCGCTG
GCCGACGAGC CGCCCGGACG GGTCCGCCCG ACCGAGGCGG TACCCATCCA GGAGGTCGTC
AAGCCCGGCC AGCGGATCCG GCCCGTCGCG CTGCCGATGT CGCTGGGACG GACACCCGAT
GTGTTCCACG GCCAGTTCAG TCCCGACGAG GTCGGCCTGT GGACGTTCCG GGTGGACGGC
TGGGGCGATC CCATCGCCAC CTGGCGCAGG CACGTGATCG CCAAGCTCGA CGCCGGCCAG
AGCGAAGGCG AGCTCGACAA CGACCTGCTC ATCGGCGCCC GACTGCTGGA CCGCGCCGCC
ACCGGAGTGC CGCGGCAGGA CCGTTATCCG CTCGCCGAGG CGGCGGCCCG GTTGCGCGAA
CCGGGCGACC CGTTCTACCG CGCCGGAGCG GCGCTGGCCC CCGAGATCAC CGCGCTGCTC
GACCAGTATC CGCTGCGGGA ACTGATCACC CGCGGCAGGC AGTACGGCGT CTGGGTGGAC
CGGCCGCTGG CCCGGTTCAG TTCCTGGTAC GAGTTCTTCC CGAGGTCGAC GGGTGGCTGG
GACAGCGCGG GCCATCCCGT GCACGGAACG TTCGCCACCG CGACCAAGGC GCTGCCCCGG
GTGGCCAGGA TGAACTTCGA CATCGTCTAC CTGCCGCCGA TCCACCCCAT CGGCAAGGTG
CACCGCAAAG GGCGTAACAA CAGTGTCACC GCCGCACCGG GCGACGTCGG GTCGCCCTGG
GCGATCGGCA GCGACGAGGG TGGTCACGAC GCCGTGCATC CCGACCTCGG CACCATCGAC
GACTTCGACG ATTTCGTCAG CGCCGCCCGC GACGAGGGGC TGGAGGTGGC GCTGGACCTG
GCACTGCAGT GCGCGCCGGA CCATCCGTGG GCGCGTGAGC ATCCCGAGTG GTTCACGGTG
CTGCCGGACG GGACGATCGC GTACGCCGAG AACCCGCCGA AGAAGTACCA GGACATCTAT
CCGCTGAACT TCGACAACGA TCCGACAGGG CTGTACGAGG AGGTCCTACG GGTCGTCAAG
TTCTGGGTGT CGCACGGGGT CAAGGTGTTC CGGGTCGACA ATCCGCACAC CAAACCGCCC
AACTTCTGGG CGTGGCTGAT CGGCGAGGTC AAGAACACCG ACCCCGACGT GCTGTTCCTC
GCGGAGGCGT TCACCCGCCC GGCACGGCTG TTCGGGTTGG CCAAGCTCGG ATACACGCAG
TCCTACACGT ACTTCACCTG GCGCACCGCG AAGTGGGAAC TCACCGAGTT CGGGCAGTCG
ATCGCCGATC ACGCCGACTA CTGCCGGCAG AGCCTGTGGG TGAACACGCC CGACATCCTG
CACGAAAGCC TGCAGCACGG CGGGCCGGGG ATGTTCGCCA TTCGCGCGGT GCTCGCATCG
ACGATGAGCC CGACGTGGGG CATGTACTCG GGGTACGAGC TGTTCGAGCA CCGGGCAGTC
CGCGAGGGCA GCGAGGAATA CCTGCACTCC GAAAAGTACG AGCTGCGGCC GCGGGACTTC
GACGCGGCAC TGTCGGACGG GGAATCGCTC GAACCCTTCA TCACCCGCCT GAACGAGATC
CGTCGCGTGC ACCCGGCGCT GCAGCAGTTG CGCACCATCA CGTTCCACCA CATCGACAAC
GACGCACTAC TGGCCTACTC CAAGTTCGAC CCGGTCTCCG GGGACCAGGT GCTCGTCGTG
GTGACCCTCA ACGCGTTCGG TCCGGAGGAG GGCATCCTCT GGCTGGACAT GGGCGCACTG
GGAATGGAGC AGCAGGACCG CTTCTGGGTA CGTGACGAGA TCTCCGGAGA CGAATACCAG
TGGGGACAAA GCAATTACGT CCGCCTTGAT CCCGCCCGCG CGGTGGCGCA CGTGTTGAAC
ATGCCCCAGG TGCCCGCCGA CCAACGAGCC AACCTTCTGC GTAGGGAGTG A
 
Protein sequence
MTAGRIEIDD VQPVVSEGRF PAKAVIGEIV PVTATVWREG HDAVSATLVV RYHGTGYPPL 
ADEPPGRVRP TEAVPIQEVV KPGQRIRPVA LPMSLGRTPD VFHGQFSPDE VGLWTFRVDG
WGDPIATWRR HVIAKLDAGQ SEGELDNDLL IGARLLDRAA TGVPRQDRYP LAEAAARLRE
PGDPFYRAGA ALAPEITALL DQYPLRELIT RGRQYGVWVD RPLARFSSWY EFFPRSTGGW
DSAGHPVHGT FATATKALPR VARMNFDIVY LPPIHPIGKV HRKGRNNSVT AAPGDVGSPW
AIGSDEGGHD AVHPDLGTID DFDDFVSAAR DEGLEVALDL ALQCAPDHPW AREHPEWFTV
LPDGTIAYAE NPPKKYQDIY PLNFDNDPTG LYEEVLRVVK FWVSHGVKVF RVDNPHTKPP
NFWAWLIGEV KNTDPDVLFL AEAFTRPARL FGLAKLGYTQ SYTYFTWRTA KWELTEFGQS
IADHADYCRQ SLWVNTPDIL HESLQHGGPG MFAIRAVLAS TMSPTWGMYS GYELFEHRAV
REGSEEYLHS EKYELRPRDF DAALSDGESL EPFITRLNEI RRVHPALQQL RTITFHHIDN
DALLAYSKFD PVSGDQVLVV VTLNAFGPEE GILWLDMGAL GMEQQDRFWV RDEISGDEYQ
WGQSNYVRLD PARAVAHVLN MPQVPADQRA NLLRRE