Gene Anae109_4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4199 
Symbol 
ID5375214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4924378 
End bp4926183 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content73% 
IMG OID640845726 
Productglycosyl transferase family protein 
Protein accessionYP_001381361 
Protein GI153007036 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.458284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATCC CGACTCAGCC GCTTCGCCGC GAGCCGACCG CCGGCCCGCG GCCCGCGGCC 
ACGACGGCGG ACGAGCCGTG GTCCTCCGCC GAGCGGCGCT GGTACGCGGG CGCGATGGCG
CTCGCCCTGC TCGTCATGGC GGCGGGGCTC GTCTTCCCGG ACCTCATGGG CGGCGACGCC
GCGCAGGACG CGGTGATGGC GCTGCGGATG TACCTCGCCG ACGACTGGGT CAACCTCGTC
AAGAACGGGC GCGACTACCT CGACAAGCCG CACCTCCTCT TCTGGTCGGC GCGGGCGAGC
TACGAGCTGT TCGGCGTGCA CGACTGGGCC TACCGGCTCC CGTCGGCCCT GGCCTCCCTG
CTCGGCGCGT GGGCGGCCTA CGGCATGGCG AGGCGCCTCC ACGGCGAGAC GGCCGGCCGG
CTCGCGGCGC TCATGGTGGT CACCGCGTAC GCGATCGTGC TCGGCAATCA CGACGTCCGC
ATGGACGCCC TGCTCATGGG CTTCACCGCC TTCGGGACCT GGCAGCTCCT CGAGTACCTG
GAGACCGGCC GCGCCCGAGC GGCGGTCCTC GGCGGCGCCG GCGTGGCGCT CGGGGTCTCC
GCGAAGGGCA TGGTCGCGGT GGCGGTGAGC GGCTGCGTGC TCTTCTTCTA CGTGTGGGGC
CGCGGCCGCT GGCGGCGGCT GTGGAGCTGG AAGATCGCGC TGGGGATCGC CGTGTTCGTC
CTCGCGCTCT CGCCGGTGCT CTTCGCCTAC TACCAGCAGT ACGATCTCCA TCCCGACAAG
GTCGTGAACG GTCGCACCGG CGTCTCGGGC GTGAAGTTCA TCCTGCTCGG GCAGAGCCTG
GAGCGCTTCG GCGGGGGCCG CGGGCACAAG ATCGCCGACG ACCACCTCTT CTTCTTCCAC
ACGCTGGCCT GGGCGTTCCT GCCCTGGAGC CTCCTGACCT ATGCCGCGTG GGCCGAGCGG
TTCCGGGAGC TGTTCCGGCG GCGCTGGGCC GCGTTCCGTG AGCGCGAGCA GCTCACCTTC
CTGGGCCCGT TCGCGTTCCT CGCCGTCCTG GGCTTCTCGC AGTTCAAGCT GCCCCACTAC
CTGAACGTCG TGCTGCCCTT CCTCGCCGTG TTCACGGCGA GCTACCTCGC CGACCTGCGC
CGCGAGGGCC GGCTGCGGGC GCTCGCGCGT CTCCGGTGGG TGCAGCTCGT CGTCATCGCC
GCGCTCCTCG CGCTCGTCGT GGTCCTGAAC GCGTGGGCGT TCCCGGTCGA GCGCGCCTGG
ATCGTGCTCG CCGCGCTCGC GCTGCTCGCC GTCCTCGTCG CGAGCCTCCG CGTCCGCGAG
CCGCTCGCGC GCGTGTGGGC GCCCTCCGCC GTCGCCATCC TCCTCGCGGA GCTCCTGGCG
AACACCAGCT TCTACCCGCG CCTCGGCCGC TACCAGCCGG GGAGGGACCT CGCGGCCGCC
GCGGAGGCGA GCGGCGTCGA CTGGGAGCGC ACGTTCTTCC TGGAGACCGT CTACCAGCCG
TTCCAGTTCT ACGCGGGGCG TGTCATCCCG CAGCTGGACT TCGCCGGCCT GCACCGCGAG
GTCGCCGCCG GACGAGAGCT CTTCCTCGCC GTGTCCGCGG AGGAGGAGCG CCGCCTGCGC
GACGAGGGCA TCCCGCACGA GGTGCTCGCC ACGAGCCCGA GCTGCCGGGT CCTCAACCTC
ACGGGAAAGT TCGTGAACCC GCGCACCCGC GACGGCACGT GCAAGACGGT GTTCCTGGTC
GCCGCGGGGG CGAGCGCCCC GGACCACCGG CGCGTCGAGC CCGGTCGAAC CGGCGACCGT
CCGTAG
 
Protein sequence
MRIPTQPLRR EPTAGPRPAA TTADEPWSSA ERRWYAGAMA LALLVMAAGL VFPDLMGGDA 
AQDAVMALRM YLADDWVNLV KNGRDYLDKP HLLFWSARAS YELFGVHDWA YRLPSALASL
LGAWAAYGMA RRLHGETAGR LAALMVVTAY AIVLGNHDVR MDALLMGFTA FGTWQLLEYL
ETGRARAAVL GGAGVALGVS AKGMVAVAVS GCVLFFYVWG RGRWRRLWSW KIALGIAVFV
LALSPVLFAY YQQYDLHPDK VVNGRTGVSG VKFILLGQSL ERFGGGRGHK IADDHLFFFH
TLAWAFLPWS LLTYAAWAER FRELFRRRWA AFREREQLTF LGPFAFLAVL GFSQFKLPHY
LNVVLPFLAV FTASYLADLR REGRLRALAR LRWVQLVVIA ALLALVVVLN AWAFPVERAW
IVLAALALLA VLVASLRVRE PLARVWAPSA VAILLAELLA NTSFYPRLGR YQPGRDLAAA
AEASGVDWER TFFLETVYQP FQFYAGRVIP QLDFAGLHRE VAAGRELFLA VSAEEERRLR
DEGIPHEVLA TSPSCRVLNL TGKFVNPRTR DGTCKTVFLV AAGASAPDHR RVEPGRTGDR
P