Gene Anae109_1270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1270 
Symbol 
ID5375130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1438828 
End bp1439856 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content72% 
IMG OID640842781 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001378462 
Protein GI153004137 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAGG AGCTGGAGGA GCAGACGCTC CACCCGCGCG CCGCGCGCGC GGCGCGGTCC 
CGCGGCCGGG AGCGGCGAGA GCCCGAGGAC GACGAGCGGC CGTCGTTCCA GCGCGACCGC
GACCGGCTCG TCCACTCCAA GGCCTTCCGG CGGCTCGCCG GCAAGACGCA GGTCTTCCTC
GCGCCCCGCG GCGATCACTA CCGGACCCGG CTCACCCACA CGCTCGAGGT CGCCCAGGTC
GCCCGGTCGA TCGCCCGCGC GCTGCGGCTC AACGAGATGC TGGTCGAGGC GATGGTCATG
GGCCACGACC TCGGGCACAC GCCGTTCGGG CACGCCGGCG AGCGGATCCT GAACGAGGTC
CTGCCGGGCG GGTTCCACCA CGTCATCCAG TCGGTGCGGG TCGTCGACGT CCTCGAGAAC
GACGGGCACG GGCTGAACCT CACCGCCGAG GTGCGCGACG GCATCCTGCG CCACTCCAAG
GGGAAGGGGA ACGTGCTGCT CAAGGGGAGC GGCGCGAAGG CGCTGACCCT CGAGGCCGAG
ATCGTTCGGC TCGCCGACAT CATCGCCTAC GTGAACCACG ACCTCGACGA CGCGCTGCGC
GCCGGCCTGT TCACCGAGGC GGACGTCCCG GCCTCCATCC GCGCGGTGCT CGGCGGCGGG
CCGACCCCGC GCTACCGCAC CCTCATCCGC GACGTGATCC GCCGCTCCGA CGTCGACGGC
GGGGGGCACA TCGAGATGTC GCCGGACGTG CACGAGGCGC TGCTCGCGCT GCGCGACTTC
CTCTATGCGC GCGTGTACGA GAACCCCGTC GTGCACGACG AGTTCGTGAA GACGCAGCGG
ATACTGCGCG ACCTGTACGG CTGGTGCCTC GAGGACGCGG CGCGGCTGCG CGAGCGTCAC
GGCGTGGTCG GGCGCGAGGG TGACCCGCCC GAGCGCACGG CGACGGACTG GCTCTCCGGC
ATGACCGATC GCTTCGCCAT CGGGGTCTGG GAGGCGATCT TCGTGCCGCG GCCGTGGAGC
GTGGTGTGA
 
Protein sequence
MLEELEEQTL HPRAARAARS RGRERREPED DERPSFQRDR DRLVHSKAFR RLAGKTQVFL 
APRGDHYRTR LTHTLEVAQV ARSIARALRL NEMLVEAMVM GHDLGHTPFG HAGERILNEV
LPGGFHHVIQ SVRVVDVLEN DGHGLNLTAE VRDGILRHSK GKGNVLLKGS GAKALTLEAE
IVRLADIIAY VNHDLDDALR AGLFTEADVP ASIRAVLGGG PTPRYRTLIR DVIRRSDVDG
GGHIEMSPDV HEALLALRDF LYARVYENPV VHDEFVKTQR ILRDLYGWCL EDAARLRERH
GVVGREGDPP ERTATDWLSG MTDRFAIGVW EAIFVPRPWS VV