Gene Anae109_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1778 
Symbol 
ID5375969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2006930 
End bp2010319 
Gene Length3390 bp 
Protein Length1129 aa 
Translation table11 
GC content74% 
IMG OID640843286 
Productglycosyl transferase family protein 
Protein accessionYP_001378965 
Protein GI153004640 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis
[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.073121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTCT CCACCTGGCT CCTCCTCCCC ATCTCGGTGC TCCTCGCCTG CGGCCACGGC 
GAGCGAAGCG CCGCGCGCGA CGCGCGGGCG CGCGCGCACG ACGAGCTGTC GGCGCTCTGG
AGCTTCTACC GCTACTCCCA CGTGGACCGC GGGCGCGTCG TCGCGCACGA CGAGGCGGGC
ATCACCACCT CCGAGGGCCA GTCGTACGCG CTGCTCCGCG CGGTGTGGGC CGGCGACCGC
GCGACGTTCG ACGAGGTCTG GCGCTGGACG CGCGAGAACC TGCAGGTCCG CGACGACGAG
CTGCTCGCGT GGAAGTGGAA GGACCGCGTC CTCGACCGCA ACGCGGCGAC GGACGCGGAC
CAGGACACCG CCCTCGCGCT CGTGCTCGCC TCACGGCGCT TCGGCGAGGC GCGCTACCTC
GAGGAGGCGC GGCGCATCCT CGCGGACGTC TGGGAGCGCG AGGTGATCCA GGCCGGCGGC
CGCTTCCTGC CGACGGGCGG GAACTGGGCG CCTCGCGAGC GCTACCCGAC GATCCACGTC
GCCTACCTCG CGCCGTACGC GTACGAGGTC TTCGCCGGGG TGGACCCGGC GCACCCGTGG
AAGGAGCTGG TGCGCGGGAG CTACGAGATC CTCCGCTTCC TGTACTTCGA CCGGGGCGTG
GTGCTGCCGC CGGAGCGGAT CTGGGTGGAC TCGCGCTCGG GGGCGCTCCT GCTCGACAAG
CCCGGCGGCG GCCCCGCGCA GACGTTCGGC TACGACTCCG TCCCGCTCTT CTGGCGAGTC
GCGCTGGACG CGGCCTGGTT CGGGCGGCGG GGAGAGGCGG ACCTGCGCGC GCGCATGCTC
GCGTTCCCAC GCGAGACCTT CCGCGCCGAG GGGAGGATCC GCGACCGCTA CACCACCGGC
GGCCGCGCGC TGTCGGAGCT CGACGGCCAG CCCCACATGG CGTCGATCGC GGCGCTCGCG
GCGGTGGAGG ACCGCGGGCT CGCCTCGGAC CTGCGATCGG GGAGGCTCGA CGCGCTCTTC
GAGCGCGCCC TCGCCGGCGA GGCGACCCCG TACTACCTGC ACAACTGGCT CTGGCTGGGC
CGCGCGCTCG AGCTCGGCGT GGCGCGGCGA TACGACGAGC CGCTCGCGTT CCTGCTCGGG
ATCGACTGGG GGCCGTTCCG GGAGCGCTTC CCCCTCGTCC CCGTGCTCGC CGCCCTCGCG
CTCGCCCCGC TCGCCCGCCG CTTCACGTGG GCCCGCGCCG GGCTGCTCGG GCTCGCGATC
GCGCTCTCCG CGCGCTACCT CGGGTGGCGC GCCCGCGAGA CCCTCAACTT CGTCGAGCCG
CTCGGGCCGG TGATCAGCCT CTCGCTCCTG GCGGCGGAGG CGTACGCGTT CTCGACCGTC
CTGCTCCTCG CCGTCCAGGT CGGCGTCTCG CGCCGGCGCC GGGAGCCGCC CGCGCCGCTC
GCGCCGGACG AGGAGGTCCC CTCCGTCGAC GTCTTCGTCC CCATCTACTC CGAGTCGCTG
GAGATCCTGG ACAAGACGCT CGCCGCCTGC ATGGCGATGC GCCATCCCCG CAAGACCGTC
CACGTGTGCG ACGACTCGCA CCGCGAGGCG GTCGCGCGGC TCGCCGCGGA GCACGGCGCG
CGGTACGTCC CCGGCCCGAA GCGCCACGCC AAGGCGGGGA ACCTCAACAA CGCGCTGTCG
CTGACGAGCG GCGACCTCGT CGTCGTGTTC GACACCGATC ACGTGCCGTG CGCGGCGTTC
CTGGAGCGGA CGGTCCCGCA CTTCCGCGAG CCGCGCATGG GCGTCGTCCA GACGCCCCAC
CACTTCTACA ACCCCGACGT CTTCCAGCGG GCCCTGGGAA CCGGTCCGGC CGTGCCGAAC
GAGGCCGACC TCTTCAACCA CGCCATCCAG GCAGGCCGCG ATCGCTGGGG CGGCGCCTTC
TTCGTGGGCT CGGGCGCGGT GTTCCGGCGC GCCGCCATCG CCTCCGTGGG CGGGTTCAAC
CTGCTCTCCA TCACCGAGGA CATCCACACG AGCCAGAAGC TCCACGCGAA GGGCTGGCGC
TCGGCGTTCG TGGACGAGGA TCTGGCGGTC GGCCTCTCCG CCGAGAACGT GCAGAGCTAC
CTCGTCCAGC GGCGGCGCTG GATGCTCGGG TGCCTGCAGA TCTTCTTCAA GGACAACCCG
CTCCTCGAGC GCGGGCTCCC GCTCCGGCAC CGCGTCGGCT ACTTCGCCTC GCTCTGGTAC
TTCTTCTTCC CGCTCGCGCG CGTGGCGTTC TTCCTCACGC CGCTCTGGTA CCTGCTGTTC
CACCTCCACC CGCTGTTCGC GGACCTGCCG GTCCTGCTCG CCTACCTGGC GCCGCACCTC
CTGCTCTCGA CGCTCGCCGC GAACGCGCTG CTGCCCGGGT GGCCGCGCCT GCTGTGGGGC
ACCGTCTACG AGGCCGCCGT GGCCTTCCCG CTGGCGCGCG CGACGCTCGA CCTGCTCCTG
CCGAAGAGCC TGGGCTTCAA GGTGACGCCC AAGGGGATCG TCTCGGATCG CCGGCGCTTC
GACGCCTCGT CGTCGCGGCT CACGCTCGCC GCGGCGGCGC TCGCGGGGGT GGCGCTCGCG
AAGGGCGCCT TCGAGCTCGT CGCGTTCGGG ATCGAGGTCG AGGCCTACGC GTTCAACCTG
TTCTGGGCGG GCTCGAACCT GGCGGCGGTG CTCGCCGCGT TGCTCGTGGC GTGGGAGCGC
CCGCAGCGCC GCGAGGACGA GCGCGTGCGC CGGCGCGTGC CGGTCCGCGT CGAGGCGCCG
GCGCTGTCGC TCTGCGCGGA GACCGTGGAC CTCTCCACGA CGGGCGCCGG CCTGCGCCTC
CCCGCGAGCG CCGCGCTGCC CCCCGCCGTG GAGGTCGCCT TCTCGGCCCC CGAACCGCTC
CGCGTCCGCG CGCGCGTCGT CTGGCGCGAG GAGGTGCGGG GCGCGACCCG CGCCGGGCTG
GCGTTCGAGG GGCTCCCCGC CGAGGCTCGC CGCGCGCTCG TGCGGATCGC CTTCTCCTCC
GAGGAGGCCT TCTCCGGGGC GCACGATGCG CGCACGCGGA GCCAGCTCGC GATGGCGCTC
CAGCTGCTGG CCGGGATCGC GCGCGCGTTC CTCCCGCTCC ACGCCCGGCG CCGGCTCGCG
CCGAGACGCC CCTCGCTGCG CCCGCTCGCC TGGGTCGGTG CCCGGGGACG CGTCCGCGGG
CTCGTGGTCG ACTCCTCCTC GGGCGGGTTC GGCGCGCTGT TCCTGGGTCG CGCCCCCCGG
GAGGACCTCC CGGTACTGCT CGCCGACGGC ATCCGCTGGG CGCGCGTCGC GCACGCGCGC
CGGGTCGTCC CGGGAGTGTG GCGGGCAGGG CTCGCTTACC TGCCGGTGCC GCTCCCGGAG
AGACCGGCGC ATGAATACCT TGCTGCCTAG
 
Protein sequence
MRVSTWLLLP ISVLLACGHG ERSAARDARA RAHDELSALW SFYRYSHVDR GRVVAHDEAG 
ITTSEGQSYA LLRAVWAGDR ATFDEVWRWT RENLQVRDDE LLAWKWKDRV LDRNAATDAD
QDTALALVLA SRRFGEARYL EEARRILADV WEREVIQAGG RFLPTGGNWA PRERYPTIHV
AYLAPYAYEV FAGVDPAHPW KELVRGSYEI LRFLYFDRGV VLPPERIWVD SRSGALLLDK
PGGGPAQTFG YDSVPLFWRV ALDAAWFGRR GEADLRARML AFPRETFRAE GRIRDRYTTG
GRALSELDGQ PHMASIAALA AVEDRGLASD LRSGRLDALF ERALAGEATP YYLHNWLWLG
RALELGVARR YDEPLAFLLG IDWGPFRERF PLVPVLAALA LAPLARRFTW ARAGLLGLAI
ALSARYLGWR ARETLNFVEP LGPVISLSLL AAEAYAFSTV LLLAVQVGVS RRRREPPAPL
APDEEVPSVD VFVPIYSESL EILDKTLAAC MAMRHPRKTV HVCDDSHREA VARLAAEHGA
RYVPGPKRHA KAGNLNNALS LTSGDLVVVF DTDHVPCAAF LERTVPHFRE PRMGVVQTPH
HFYNPDVFQR ALGTGPAVPN EADLFNHAIQ AGRDRWGGAF FVGSGAVFRR AAIASVGGFN
LLSITEDIHT SQKLHAKGWR SAFVDEDLAV GLSAENVQSY LVQRRRWMLG CLQIFFKDNP
LLERGLPLRH RVGYFASLWY FFFPLARVAF FLTPLWYLLF HLHPLFADLP VLLAYLAPHL
LLSTLAANAL LPGWPRLLWG TVYEAAVAFP LARATLDLLL PKSLGFKVTP KGIVSDRRRF
DASSSRLTLA AAALAGVALA KGAFELVAFG IEVEAYAFNL FWAGSNLAAV LAALLVAWER
PQRREDERVR RRVPVRVEAP ALSLCAETVD LSTTGAGLRL PASAALPPAV EVAFSAPEPL
RVRARVVWRE EVRGATRAGL AFEGLPAEAR RALVRIAFSS EEAFSGAHDA RTRSQLAMAL
QLLAGIARAF LPLHARRRLA PRRPSLRPLA WVGARGRVRG LVVDSSSGGF GALFLGRAPR
EDLPVLLADG IRWARVAHAR RVVPGVWRAG LAYLPVPLPE RPAHEYLAA