Gene Anae109_3220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3220 
Symbol 
ID5375984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3773526 
End bp3774674 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content78% 
IMG OID640844743 
Productglycosyl transferase group 1 
Protein accessionYP_001380399 
Protein GI153006074 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGC ACGATGAGGC ACCCCGGGGC GCGGCCCGCG GGCCGCGCGT CCTCCTGACG 
GCGGACACCG TGGGCGGCGT CTTCGCCTAC GCGGCGGAGC TCTGCGCCGC GCTCGCGCAG
CGCGGCTGCG CCGTCGCGCT CGCGACGCAG GGGCGCCCGC TCTCCGCGGA CCAGCGGGCG
GAGCTGGCGC GTGTTCCCGG CCTCGAGGTG CACGAGGGGA CGGGCAAGCT CGAGTGGATG
CAGGATCCCT GGGACGACGT CGCGCGGGAG GGAGAGCGGC TCCTCTCGAT CGCGCGGCGG
TTCCGGCCGG ACGTCGTGCA CCTCAACGGC TACGCGCACG GCGCGCTGCC GTTCGGGGTG
CCGAAGGTGG TCGTGGCCCA CTCCTGCGTC CTCTCCTGGT TCGAGGCGGT GCGCCACGCG
CCCGCGCCGC CGTCGTTCGA CCGCTACCGC CTCGAGGTGC GCCGCGGGCT CGACGGCGCC
GACGCGGTGG TGGCGCCGAC GCGGGCCATG CTCCGCGCCC TCGAGCGCCA CCACGGCAAG
GTCCGGCGCG GGCTCGTCAT CGCGAACGGC CGCGCCCCCG AGCGCTACCC GCCGCGCCCG
AAGGAGCCGT TCGTGCTCTG CGCCGCGCGG CTGTGGGACG AGGCGAAGGG GGCAGCCACG
CTCGACGCGG CCGCAGGCCG CCTCGCCTGG CCGGTGCTGC TCGCAGGCGA TGAGATGAGC
CCGGACCTCG CCCACCCCGG CGCCTCGGCG CCGCGCCACG CGCGCCCCCT GGGACGGCTC
GCGCCCGACG CGCTCGCCGT GTGGTACGGC CGCGCCTCGA TCTATGCGCT CCCCGCACGG
TACGAGCCCT TCGGCCTCTC CGCGCTGGAG GCGGCGCTCG CCGGCTGCGC CCTGGTGCTC
GGCGACGTCC CGAGCCTCCG TGAGGTCTGG CTCGGCGCCG CGGCCTTCGT CCCGCCCGGG
GACGTGGAGG CGCTCGCCTC GACGCTCGCG GGGATCGTCC ACGACGCGCC GGCGCGCGCC
GAGCTCGGGC GCCAGGCGCG GCGGCGCGCC TTGCTCTTCG GGCGCGAGCG GATGGCGGAG
CGCTACCTCG CGACGTACGG CGGCCTGCTC GCGCGGGCGG AGGAGGGGGC GGTCCCGTGC
GCGTCGTGA
 
Protein sequence
MSGHDEAPRG AARGPRVLLT ADTVGGVFAY AAELCAALAQ RGCAVALATQ GRPLSADQRA 
ELARVPGLEV HEGTGKLEWM QDPWDDVARE GERLLSIARR FRPDVVHLNG YAHGALPFGV
PKVVVAHSCV LSWFEAVRHA PAPPSFDRYR LEVRRGLDGA DAVVAPTRAM LRALERHHGK
VRRGLVIANG RAPERYPPRP KEPFVLCAAR LWDEAKGAAT LDAAAGRLAW PVLLAGDEMS
PDLAHPGASA PRHARPLGRL APDALAVWYG RASIYALPAR YEPFGLSALE AALAGCALVL
GDVPSLREVW LGAAAFVPPG DVEALASTLA GIVHDAPARA ELGRQARRRA LLFGRERMAE
RYLATYGGLL ARAEEGAVPC AS