Gene Gura_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1870 
Symbol 
ID5166871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2170404 
End bp2171471 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content57% 
IMG OID640549361 
Productthreonine aldolase 
Protein accessionYP_001230633 
Protein GI148263927 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.163709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGTG CCGATCTTCA AAAACCGCTG CAGCATCATC AGTTTGCCAG CGACAATTAC 
GCCGGGATCT GCCCGGAAGC AATGCAGGCG ATGGCGGAAG CAAATCGCGG CTACGCCTCA
TCGTACGGAG ACGACTACTG GACCGGCAAG GCCTGCGAAC GGCTGCGGGA GCTCTTTGAG
ACCGACTGCG AGGTGTTTTT CGTCTTTAAC GGCACGGCAG CCAACTCTCT GGCGCTTGCT
TCGCTCTGCC AGTCCTATCA CAGCATCATC TGCCACGAAA TGGCGCACAT CGAAACCGAC
GAGTGCGGCG CTTCCGAGTT TTTCTCCAAC GGCACCAAGG TTTTGCTGGT GCATGGTGAA
AACGGAAAGG TCGATCTCGG AGAAATTGAA CATACGGTTC AGCGCCGCAC GGACATCCAC
TATCCGAAAC CGCGTGCGCT GAGTATAACC CAGGCAACGG AACTGGGCAC GGTCTACACC
GTTGATGAAA TGCAGGCGAT CGGCGAGGTT GCCAGGCGTT TTTCTCTGCG GATTCATATG
GACGGAGCCC GTTTCGCCAA TGCCATAGCG TCCTTGAACG TCGCACCGAA GGAAATCACA
TGGAAAGCGG GGGTGGATGT GCTCTGTTTC GGCGGAACGA AAAACGGCTT CGCCATGGGC
GAAGCCGTCA TTTTCTTTAA CCGCGAACTG GCATTCGAGT TCGACTACCG TTGCAAACAG
GCTGGGCAAC TCGCCTCAAA GATGCGCTAC CTTGCCGCTC CATGGATCGG CACCCTGGAA
AGCGGCGCCT GGCTGCGTCA TGCTGCCCAT GCCAATGCCT GCGCCCGGAA GCTGGAAAAA
GAGCTGCAAT CCATTGCCGG CATCAGAGTC ATGTTCCCCT GCCAGGCAAA CTCCGTATTT
CTGGAGATGC CGCCAACGCT GATGGAAGCG CTGCGCAACC GCGGCTGGCA CTTTTACACC
TTCATCGGCT CTGGAGGCGC CCGCTTCATG TGCTCATGGG AAACCAGCGA TGCGGACATC
GCCGCTCTGG TGAAGGACAT CCGCGAACTG GTGCAACAGA ATACCTAG
 
Protein sequence
MKRADLQKPL QHHQFASDNY AGICPEAMQA MAEANRGYAS SYGDDYWTGK ACERLRELFE 
TDCEVFFVFN GTAANSLALA SLCQSYHSII CHEMAHIETD ECGASEFFSN GTKVLLVHGE
NGKVDLGEIE HTVQRRTDIH YPKPRALSIT QATELGTVYT VDEMQAIGEV ARRFSLRIHM
DGARFANAIA SLNVAPKEIT WKAGVDVLCF GGTKNGFAMG EAVIFFNREL AFEFDYRCKQ
AGQLASKMRY LAAPWIGTLE SGAWLRHAAH ANACARKLEK ELQSIAGIRV MFPCQANSVF
LEMPPTLMEA LRNRGWHFYT FIGSGGARFM CSWETSDADI AALVKDIREL VQQNT