Gene Anae109_3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3071 
Symbol 
ID5374573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3587982 
End bp3590006 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content72% 
IMG OID640844595 
Productalpha amylase catalytic region 
Protein accessionYP_001380251 
Protein GI153005926 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.224313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.685148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAGC GCGCACGCTC GCGGCCGGCC TCTCCGCCGG CCACCGAGCG CCCGCCCCGG 
GTCGTCATCG AGAACGTCAC GCCCGAGATC GACGGCGGGC GCTTTCCGAT CAAGCGCACC
GTGGAAGAGC AGGTCGTCGT CGAGGCGGAC GTGTTCACCG ACGGGCACGA GTCGCTCGCG
GCCGTCCTCC GTTACCGGCG CGAGAGCGAT CCGGCCTGGA CGGAGCTCCC GATGGTGCCC
CTCGGCAACG ATCGCTATCG CGCGTCGTTC GGCGTCCACG CCCTCGAGCC CTACCGCTAC
ACGGTCGAGG CGTGGCTGGA TCCGTTCGCC ACCTGGGAGC ACGGGCTCGC GAAGAAGGCG
ACGGCAGCCG CAGTCGAGCC GGTGGATCTG CTCGTCGGGG CGGAGCTCGT GGAGCAGGCG
GCGTCGCGCG CGCCGGACGA GGCCGCGGCG CGGCTGCGGA GGTGGGCCGC GGCGCTGCGC
GGGGACGAGC CGCTGGAGGC GCGGGTGCGG CGAGCGCTCG ACCCGGAGCT GCGGCAGCTC
GTGGCACGCC ACCCGGATCG CGCCCACGCC GCGCGTCATC CGCGCGAGCT GGCGGTGTCG
GTCGATCGCG AGCGGGCGCG CTTCAGCAGC TGGTACGAGC TCTTCCCCCG CTCGGCGTCC
CAGGAGCCCG GGCGGCACGG CACCTTCCAG GACGTCCTCG ACCGGCTGCC GGACGTCGCG
TCGATGGGCT TCGACGTCCT CTACCTGCCC CCCATCCATC CCATCGGGCA CGCCTACCGC
AAGGGTCCGA ACAACGCGCC GCAGGCCTCC CCCGGAGACG TCGGCAGCCC CTGGGCGATC
GGCGGACCCG AGGGCGGGCA CACCGCCCTC CACCCGCGCC TCGGGACGAT CGAGGACTTC
CGGCGGCTCG TCGCGCGCGC GGGGGAGCTC GGCCTCGAGG TGGCGCTGGA CCTCGCCTTC
CAGTGCTCGC CGGATCATCC CTGGGTGAAG GAGCATCCGC AGTGGTTCCA CCGTCGGCCG
GACGGCAGCA TCCAGTACGC CGAGAACCCG CCGAAGAAGT ACCAGGACGT CTACCCCATC
GAGTTCGACA ACCCCGACTG GCGGGGGCTC TGGGCGGCGC TCCTCGGCGT CGTGGAGCAC
TGGGCGGAGG AGGGGGTGCG CATCTTCCGC GTGGACAACC CGCACACGAA GCCGTTCCGG
TTCTGGGAGT GGCTCATCGC TGAGGCCCGC CGCGCGCGGC CCGAGCTCGT CTTCCTCGCA
GAGGCCTTCA CCCGGCCCAA GGTGATGGCG CACCTCGCGA AGGCCGGCTT CGCGCAGTCG
TACACGTACT TCACCTGGCG CAACACCAAG GGGGAGCTCA CGCAGTACCT CACCGAGCTC
ACGCGCACCC AGGCCAAGGA GTACTTCCGG CCGAACTTCT GGCCCAACAC GCCGGACATC
CTCCCCGAGC CGCTGCAGTA CGGGGGGCGC CCGGCCTTCC AGGCGCGGCT GGTCCTCGCG
GCCACGCTGG CCTCGAGCTA CGGCATCTAC GGCCCCGCCT TCGAGCACCT CGAGGCCCGC
GCGGTGCGCC CCGGCAGCGA GGAGTACCTC GACTCGGAGA AGTACCAGCT CCGCCACTGG
GACCTGGAGC GGGCGGACAG CCTGCGCGAC TTCGTCGCCC GGGTGAACCG CATCCGGCAC
GAGAACCCGG CCCTGCAGTC GAACGAGGGG CTGACGTTCC ACCGGATCGA CAACGAGCAG
CTCCTCGCGT ACTCGAAGGC GACCGAGGAC CTCGCCGACC TCGTGCTCGT GGTGGTGAAC
CTGGATCCGC ACCACGTGCA CGCGGGCTGG CTCGAGCTCC CGCTCGAGCT CCTCGGGCTC
TCGCGCGACG AGCCGTACCA GGTCCACGAC CTGCTCGGCG ACGGGCGCTA CCTCTGGCAC
GGCGCGCGGA ACTACGTGGA GCTCGATCCG CGGAGCGCCC CCGCCCAGAT CTTCCGCATC
CGCCGGCGCG TCCGGACGGA GCGGGACTTC GACTACTTCA TGTGA
 
Protein sequence
MEERARSRPA SPPATERPPR VVIENVTPEI DGGRFPIKRT VEEQVVVEAD VFTDGHESLA 
AVLRYRRESD PAWTELPMVP LGNDRYRASF GVHALEPYRY TVEAWLDPFA TWEHGLAKKA
TAAAVEPVDL LVGAELVEQA ASRAPDEAAA RLRRWAAALR GDEPLEARVR RALDPELRQL
VARHPDRAHA ARHPRELAVS VDRERARFSS WYELFPRSAS QEPGRHGTFQ DVLDRLPDVA
SMGFDVLYLP PIHPIGHAYR KGPNNAPQAS PGDVGSPWAI GGPEGGHTAL HPRLGTIEDF
RRLVARAGEL GLEVALDLAF QCSPDHPWVK EHPQWFHRRP DGSIQYAENP PKKYQDVYPI
EFDNPDWRGL WAALLGVVEH WAEEGVRIFR VDNPHTKPFR FWEWLIAEAR RARPELVFLA
EAFTRPKVMA HLAKAGFAQS YTYFTWRNTK GELTQYLTEL TRTQAKEYFR PNFWPNTPDI
LPEPLQYGGR PAFQARLVLA ATLASSYGIY GPAFEHLEAR AVRPGSEEYL DSEKYQLRHW
DLERADSLRD FVARVNRIRH ENPALQSNEG LTFHRIDNEQ LLAYSKATED LADLVLVVVN
LDPHHVHAGW LELPLELLGL SRDEPYQVHD LLGDGRYLWH GARNYVELDP RSAPAQIFRI
RRRVRTERDF DYFM