Gene Anae109_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_0783 
Symbol 
ID5376254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp891822 
End bp892799 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content77% 
IMG OID640842293 
ProductHhH-GPD family protein 
Protein accessionYP_001377979 
Protein GI153003654 
COG category[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.336005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCTTT TCCGCGAGGA GCCCCCGCAA CGCCCTCGCG GGGACGACGT GCGAGTACCC 
GAGCCCTTCG ACCTCGACCT CACGGTCCGC AGCCATGGCT GGTACGACCT GCCGCCGTGG
CGATACGACC CGGCGCGCCG CGTGCTCGGC CGCCCGCTCC TCCTCGCGGG AGGGAGGACG
GTCTACGCGG AGGTCGCGGC GGGGCAGGGC GGGCTCGCGT TCCGCGCGCT GGCCGAGGGG
CGGCTCGGCC CGGCCGAGGC GCGCGCCGCG CGGGCGGCGA TGCGGACCTG CCTGTCGCTC
GACGAGGATC TCTCCGGGTT CCACGCCCGG GCCGCCGCGC TCGAGGCGCG CCGCGCGGAG
GGCCGGGCGA AGGACCTGCC CGATCTCCGC TGGGCGCTGG CGCGCGGCGC GGGACGGCTG
CTCCGCTCCC CGACCGTGTT CGAGGACGCG GTGAAGACCC TCTGCACCAC CAACTGCTCA
TGGGCGCTCA CGCGCGCGAT GGTCTCGCGC CTGTGCGACG CGCTCGGCGC GACCGCGCCG
CTCGGTACGC GGGCGTTCCC CACCCCCTCC GCGATGGCGT CGATGCCGGA GCGCTTCTAC
CGGGACGAGA TCCGCGCCGG GTACCGCGCT CCCTTCCTCG CGGCGCTCGC CCGCGACGTC
GCCTCGGGCG CCCTGGACCT CGAGGGGCTC CGCGGGACGG CGGAGCCGAC CGACGCGCTC
GCGCGGCGCA TTCGCGCCCT CGCCGGGTTC GGCCCGTACG CGACCGAGCA CCTCCTGCGG
CTGCTCGGCA GGCACGATCA CCTCGCGCTC GACGCCTGGA CCCGCCCGGC CCTCGCGCGG
CTGCGCGGGC GCCGCCGCGT CCCGACGGAT CGGGCCCTGC GCCGCTGGTA CGCGCCGTAC
GGGGAGTTCG CGGGCCTCGC GATGTGGCTC GAGGTCACCG CCGACTGGCA CGACGGTCTG
CCGGCCTGGC CGCCCTGA
 
Protein sequence
MLLFREEPPQ RPRGDDVRVP EPFDLDLTVR SHGWYDLPPW RYDPARRVLG RPLLLAGGRT 
VYAEVAAGQG GLAFRALAEG RLGPAEARAA RAAMRTCLSL DEDLSGFHAR AAALEARRAE
GRAKDLPDLR WALARGAGRL LRSPTVFEDA VKTLCTTNCS WALTRAMVSR LCDALGATAP
LGTRAFPTPS AMASMPERFY RDEIRAGYRA PFLAALARDV ASGALDLEGL RGTAEPTDAL
ARRIRALAGF GPYATEHLLR LLGRHDHLAL DAWTRPALAR LRGRRRVPTD RALRRWYAPY
GEFAGLAMWL EVTADWHDGL PAWPP