Gene Anae109_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1408 
Symbol 
ID5376444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1590422 
End bp1593214 
Gene Length2793 bp 
Protein Length930 aa 
Translation table11 
GC content72% 
IMG OID640842918 
Productglycosyl transferase family protein 
Protein accessionYP_001378599 
Protein GI153004274 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase
[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.370987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.266781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGGC ACTCGTGGCA AAGTGCGCAC TGCATGAGCG CTCCCGTGAC CTCTCCCCCT 
TTTGACCGGA CGAGTTCGCC AGGCTCCTCC CCTCGAAACG ATCGTCGCGA GGAGGCGACC
GCGCCCCGCG ATCGCGACCA CGCCAAGGGG CTCGAGGCGG CGCTCGCGGA GGCGCGGCTC
GCGCTGGTGC GGGAGCGCTC CCGCTCCCTC CAGCTCCGCC AGGATCTCCA GCGCGTCGAG
GAGCGGCTGC AGCGCACGGA GGCCGAGCTC GAGGGTATCC AGGCGAGCAC CGCGTGGAGG
GCGGCGCGGA TCTACTACCG TGTCCGGGAC GACTTCTGGC CGCTGCGGGT GGCGCACCGC
GCCGCCCGCA GGCTCAAGGC CCGGCTCGGC GGGCTTCGCA GCGCGGTCGC GCAGGCCCGC
TCGCTCCTGC CCAACGGCGG AGGCCAGCTG ACCGGCCTCA CCCTGGGGCG CCGCGACCGC
GTGAGCGTCG TGCTCCCGGT CTACAACCAG GCGTCGATGC TCGCCGAGTC CATCGAGAGC
GTCCTCGGCC AGACGTACCG CGATCTCGAG CTCATCGTGG TGAACGACGG GTCGACGGAC
GGCGTGGAGT CGATCCTCGA GCGCTTCGCC GACGACCCCC GCGTCGTCGT CGTCACTCAG
CCGAACCAGA AGCTCCCGTC CGCCCTCAAC AACGGGTTCG ACTTCGCCAC CGGCGAGTTC
TACACCTGGA CCTCGGCGGA CAACGTGATG CTGCCTCGCC AGCTCGAGGT GCTCGTGGGG
TATCTCCGCG CGCACCCGGA GCAGGCCATG GTCTTCTCCG ATTACCAGGC GATCGACGAT
CGCGGGCTTC CGCTCGACGA CGCCTCCTTC CGCCCGCAGA ACCAGGACCC GCACGATCGC
AGCCGGATGA GGCTCCCGCG CGAGGTCACG GTCGCGAACC TCCACCGAAG CGGCGACAAC
TTCATCGGGG CCAGCTTCCT CTATCGGCGC GACGCGGCCC GCGTCGTCGG CGCTTACGCG
GAGGACACCT TCGGCGGGGA GGACTACGAC TACTGGCTCC GGGTCCACGC GCTCTTCGGG
ATCGGCCACG TCGACGAGGT GCTCTACCGG TACCGCGTCC ACGACAACAC GCTCAACGCG
CGCGCGAGAG AGCTGCGCAT CGGGGAGAGC GTTCGCCGCC TGCTCGAACG GCACGGCGAG
CGGCTCGCGT TCCTCGCGTC CTCGCCGTCG TGGCTGCAGC TCGGCCTCGA GATCCCGGGG
GCGCGGCCCG CGGCCGCCGG CGAGACGGCC GACATCGTGG CCTACCCGCT GTCGCGCGCG
GGCGATCCCG CCCTGCGCGA GCCCGCGGAC GCGGGCCGCT CACTGCGCGT CTGCGTCGTG
GACCTGCCGC TCGACGAGGT GGACGAAGCG GTGGCCGCGC GTGCGGATCT GCTGTTCGTC
ATCGACCCGA TCGTCCAGGC GCACCTCGAT CGCGAGCTGC CGGGCCGCGC GTTCCTCCTC
GATCTCGCCC GCGATCCCGC GCTCGCCCGC CGGATCGCGA GCCTGCGCCT GTTCGAGAAG
CGCAGCGCGC ATCCGGGTCG CCAGCCGGCC GCCCGGGTCC TGCCCGTGCG CGAGCCGCTG
CGCGTCGGCT TCCAGGTCGA CGGGATGGAC CGCGGCGGGC TCGAGCAGAT CGTCTCGGAT
CTCGTTCGCA ACGTCGACCG GTCCCGAGTG CTTCCGACCC TGCTCGTTCA CGCCTCCGAC
TGCGGCACCG CCGGCCGCGC GCTGAGGGAG AACGGGCTGG AGGTCGTCGT CACGGGGCGC
GACGAGCGAC GGCTGGTCGA GGCGGTCCGG GAGCGAAATC TCCAGGTGGT GAACCTGCAC
CACAGCGTCG CCGGCCTCTC CGCCTATCGC GACCTCGGCG TAGCGACCGT GTACACCGTG
CACTCGAGCT ACGTGTGGCT GGATCCGCTC GCCCGGAGGG CGCGCGCCGA GCGGCTGCGC
TCCGTCGATC TGCATCTCGC CGTGTCGCCG CAAGTCGGGC GCTTCTTCGA GGAGACCTTC
GAGGTGGATC CCCGGCGGGT CCGCATCGTG CCGAACGGGC TCGATCCGAG CGAGCTGGAG
GACGCCCACC GGGCGTCCCG CGAGGGCCTG GGCCTCTCCG AGACGGACTT CGTCTTCCTG
CAGGTGGGGT CGTTCTCGCC CAACAAGCTG CAGCGCACCA CCGTCGAGGC GTTCGCCCGC
GTGTGCACCG GGGTGCCGGA GGCTCGGCTC GTCCTCGTCG GGAACGCCTT CGATCCCCGC
TACGCGCGTG AGGTCGAGGA CGCCGTCCGC GCCGGAGGAT TGACCGACCG CGTACGTATC
CTGCCATGGG GAAGTCGCGA TTCGATCGCA GGTCTCATGG GCGCCGCCGA CTGCTTCGTC
CTCCCATCGC TCGTCGAGGG GTGGAGCATC GCGGTCATGG AGGCCATGTA CTTCGGCCTC
CCGCTCGTCG TCTCCGACGT GGGATCGGCG CGGGAGGTGA TCCGCGACGG CGACATCGGG
ATCGTCATCC CGTGCCCCTA CGAGCGGTTG TCGGAGCTCA CCCTCGAGCA CCTCGTCGCG
CTCGGGGAGC GGCCGCCGGA GGAGTACGTC GAGGTGCTCG CCGCGGCGAT GCGCGAGGTG
GCGCTCCACA GGGACGTCTG GCGGGAGCGC GGCGCGCGTG GCCGTGACAA GGTGACCGGT
CCGTTCCACG TCCGCGCGAT GGCGGACGCG TACGCCCGCG CCTACGCCGA CGCGTACCGC
TGGCTGCGGA AGACCGGGGC GCGGCGCGCC TGA
 
Protein sequence
MDRHSWQSAH CMSAPVTSPP FDRTSSPGSS PRNDRREEAT APRDRDHAKG LEAALAEARL 
ALVRERSRSL QLRQDLQRVE ERLQRTEAEL EGIQASTAWR AARIYYRVRD DFWPLRVAHR
AARRLKARLG GLRSAVAQAR SLLPNGGGQL TGLTLGRRDR VSVVLPVYNQ ASMLAESIES
VLGQTYRDLE LIVVNDGSTD GVESILERFA DDPRVVVVTQ PNQKLPSALN NGFDFATGEF
YTWTSADNVM LPRQLEVLVG YLRAHPEQAM VFSDYQAIDD RGLPLDDASF RPQNQDPHDR
SRMRLPREVT VANLHRSGDN FIGASFLYRR DAARVVGAYA EDTFGGEDYD YWLRVHALFG
IGHVDEVLYR YRVHDNTLNA RARELRIGES VRRLLERHGE RLAFLASSPS WLQLGLEIPG
ARPAAAGETA DIVAYPLSRA GDPALREPAD AGRSLRVCVV DLPLDEVDEA VAARADLLFV
IDPIVQAHLD RELPGRAFLL DLARDPALAR RIASLRLFEK RSAHPGRQPA ARVLPVREPL
RVGFQVDGMD RGGLEQIVSD LVRNVDRSRV LPTLLVHASD CGTAGRALRE NGLEVVVTGR
DERRLVEAVR ERNLQVVNLH HSVAGLSAYR DLGVATVYTV HSSYVWLDPL ARRARAERLR
SVDLHLAVSP QVGRFFEETF EVDPRRVRIV PNGLDPSELE DAHRASREGL GLSETDFVFL
QVGSFSPNKL QRTTVEAFAR VCTGVPEARL VLVGNAFDPR YAREVEDAVR AGGLTDRVRI
LPWGSRDSIA GLMGAADCFV LPSLVEGWSI AVMEAMYFGL PLVVSDVGSA REVIRDGDIG
IVIPCPYERL SELTLEHLVA LGERPPEEYV EVLAAAMREV ALHRDVWRER GARGRDKVTG
PFHVRAMADA YARAYADAYR WLRKTGARRA