Gene GSU1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1796 
Symbol 
ID2686344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1961541 
End bp1962641 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content53% 
IMG OID637126483 
ProductDHH family protein 
Protein accessionNP_952846 
Protein GI39996895 
COG category[R] General function prediction only 
COG ID[COG0618] Exopolyphosphatase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00640097 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGAAG GTTGTTCGCT TGCAAAGACG AAGAGTTACA CGGACGAAAT GCTCAACTGG 
GTGAGGGGCA AGGGCAAGAT CCTAATCGTT GTTCACGACA ATCCGGACCC CGATTGTCTC
GCCTCTGCCA TAGCGCTGCG GCATCTGTTC GTTATGAAGC TGAACAGGGA TGCCACAATT
GCTTTTTCCG GCATGATCGG CAGGAGCGAG AACATCGCCA TGGCCAAGCA ACTGCAAATC
CCCCTGACCC CGCTGGCCCT CATCGATTAT CAGGATTTTT CCGTTACGTG CATGCTCGAT
ACCCAGCCGG GGGCGGGCAA TAATTCACTC CCCCCTGACA AGCGGGTTGA TATTCTTATT
GATCATCACC CCAGACGGGA AGCCGGCTTG AGATGCCGAT GGGATGATAT CCGCGAAGAA
TACGGCGTAA CTGCAACCAT CGTCTATGAA TATCTGCAGG CCCAGGATGT CCCTATTGGC
AGCAATCTCG CCACGGCGCT GTTTTATGCG ATTAAGTCGG AAACACAGGA TCTTGGTCGC
GAGGCCGCCC GACCCGATAG AGATGCCTAT CTGAAGCTTT TCCCCCTGGC CAACAAAAAA
CTGCTCTATG AAATCACCTA TCCAAAGCTT CATGTGGAAT ATTACCTCAC GATCAACCGC
GCTATTGAAC ATTCACGTAT CTATGGAAAT CTGCTGGCAA CAAATCTTCA AGAAGTAAGT
TTTCCGGAAA TTGTCGCCGA GATGGCGGAT TTTTTTCTCC GGCTTGAAGG GGTCGATGTT
GTGCTGGCGA TTGGCCGATT TCACAACGAA ATCATTCTTT CAGTGCGCAC CTCCCGGCAC
GACGTTAATG CAGGAACGTT GATCAAGCGG CTCGTCGAGG GCAGAGGCGC CGCTGGAGGC
CATGGCATGA TGGCGGGGGG GAAAATCGAA GGACTTTCCG ATTCCGCTGC GGAACTGGAG
GATATCGAGC AACTCCTCGT CCGTCGTTTT ATCGATATCT TCGCGCTGGG GCACATTCGG
TCGCTGTCGT TGAGTGCTCT CAGGCGTACG GCTCTGCCGG CCCTTGCCGC CGAGATTGAC
AACCTGCATC ACATCGTTTA A
 
Protein sequence
MGEGCSLAKT KSYTDEMLNW VRGKGKILIV VHDNPDPDCL ASAIALRHLF VMKLNRDATI 
AFSGMIGRSE NIAMAKQLQI PLTPLALIDY QDFSVTCMLD TQPGAGNNSL PPDKRVDILI
DHHPRREAGL RCRWDDIREE YGVTATIVYE YLQAQDVPIG SNLATALFYA IKSETQDLGR
EAARPDRDAY LKLFPLANKK LLYEITYPKL HVEYYLTINR AIEHSRIYGN LLATNLQEVS
FPEIVAEMAD FFLRLEGVDV VLAIGRFHNE IILSVRTSRH DVNAGTLIKR LVEGRGAAGG
HGMMAGGKIE GLSDSAAELE DIEQLLVRRF IDIFALGHIR SLSLSALRRT ALPALAAEID
NLHHIV