Gene Anae109_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4084 
Symbol 
ID5378145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4777394 
End bp4778461 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content75% 
IMG OID640845611 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001381246 
Protein GI153006921 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR03466] hopanoid-associated sugar epimerase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.129655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGAT CTCTCGACGA CAGCGGCTCG GCGCCGGGGG CGAGCGGAAG AGTGCTCGTC 
ACCGGCGCGA CCGGTTTCCT CGGGGCGAAC GTGGCGCGGC TCCTGCTCGA GCGGGGCGTG
GAGGTGCGCG CGCTCGTCCG GGCGTTCTCC CCGCGCACGA ACGTGGACGG GCTCCCGATC
GAGCTCGTCG AGGGAGACCT CCGCGACGCG GAGGCGGTGC GCCGCGCGGT GCGCGGCTGC
CGGCGGGTGT TCCACGTCGC CGCGGACTAC CGCTTCTGGG CGCGCGATCC GCGCGAGCTC
TACGCGTCGA ACGTCGAGGG CACGGTGCAC GTCATGGAGG CGTGCCTCGC CGAAGGCGTC
GAGCGGGTGG TCTACACCTC CACGGTCGGC ACCATCGGTC TCGCGGCGGC GCCCGCGCCC
TGCGACGAGC ACACGCCGCT CGTGGCGGGG CAGCTCACGA GCCACTACAA GCGCTCGAAG
CTCGAGGCGG AGCGGGCGGC CCTCTCCTAC GTCGCGCGCG GCCTCCCCGT CGTGGTGGTG
AACCCGTCCG CGCCGGTCGG CGCCTGGGAC GTGAAGCCGA CGCCGACCGG GCGCATCCTC
CTGGACTTCG CGCTCGGGAA GCTCCCCGCC TTCGTGGACA CGGGCCTGAA CGTGGTCCAC
GCGCGCGACG TGGCGGAGGG GCACCTGCTC GCCGCGGCGC GCGGCCGCGT CGGCGAGCGC
TACATCCTCG GCCACCGGAA CATGACGCTC GCGGAGATCC TCGCCGAGGC GGGGGCGATC
CTGGGCCGTC CGGCGCCGCG CCTGCGGCTC CCGTACGCGG CCGCGCTCGC GGTGGGAGCG
CTCGACACCG CGCTCTCCCG CCTCACGCAC CGGCCGCCCA CGGTGGCGCT GGAGGCGGTG
CGCATGTCGC GGCGCCGCAT GTTCTTCGAC GCCGGCAAGG CGGTGCGCGA GCTCGGGCTG
CCGCAGACGC CGGTCCGCCG GGCCTTCGAG GACGCGATCG CGTGGTTCGC CGAGCGGGGC
TATCTCGCGG GGGCGGGGCA GGGGAGGACG GCATGGGCAT CCCGCTGA
 
Protein sequence
MTGSLDDSGS APGASGRVLV TGATGFLGAN VARLLLERGV EVRALVRAFS PRTNVDGLPI 
ELVEGDLRDA EAVRRAVRGC RRVFHVAADY RFWARDPREL YASNVEGTVH VMEACLAEGV
ERVVYTSTVG TIGLAAAPAP CDEHTPLVAG QLTSHYKRSK LEAERAALSY VARGLPVVVV
NPSAPVGAWD VKPTPTGRIL LDFALGKLPA FVDTGLNVVH ARDVAEGHLL AAARGRVGER
YILGHRNMTL AEILAEAGAI LGRPAPRLRL PYAAALAVGA LDTALSRLTH RPPTVALEAV
RMSRRRMFFD AGKAVRELGL PQTPVRRAFE DAIAWFAERG YLAGAGQGRT AWASR