Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1796 |
Symbol | |
ID | 2686344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 1961541 |
End bp | 1962641 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637126483 |
Product | DHH family protein |
Protein accession | NP_952846 |
Protein GI | 39996895 |
COG category | [R] General function prediction only |
COG ID | [COG0618] Exopolyphosphatase-related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00640097 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGAAG GTTGTTCGCT TGCAAAGACG AAGAGTTACA CGGACGAAAT GCTCAACTGG GTGAGGGGCA AGGGCAAGAT CCTAATCGTT GTTCACGACA ATCCGGACCC CGATTGTCTC GCCTCTGCCA TAGCGCTGCG GCATCTGTTC GTTATGAAGC TGAACAGGGA TGCCACAATT GCTTTTTCCG GCATGATCGG CAGGAGCGAG AACATCGCCA TGGCCAAGCA ACTGCAAATC CCCCTGACCC CGCTGGCCCT CATCGATTAT CAGGATTTTT CCGTTACGTG CATGCTCGAT ACCCAGCCGG GGGCGGGCAA TAATTCACTC CCCCCTGACA AGCGGGTTGA TATTCTTATT GATCATCACC CCAGACGGGA AGCCGGCTTG AGATGCCGAT GGGATGATAT CCGCGAAGAA TACGGCGTAA CTGCAACCAT CGTCTATGAA TATCTGCAGG CCCAGGATGT CCCTATTGGC AGCAATCTCG CCACGGCGCT GTTTTATGCG ATTAAGTCGG AAACACAGGA TCTTGGTCGC GAGGCCGCCC GACCCGATAG AGATGCCTAT CTGAAGCTTT TCCCCCTGGC CAACAAAAAA CTGCTCTATG AAATCACCTA TCCAAAGCTT CATGTGGAAT ATTACCTCAC GATCAACCGC GCTATTGAAC ATTCACGTAT CTATGGAAAT CTGCTGGCAA CAAATCTTCA AGAAGTAAGT TTTCCGGAAA TTGTCGCCGA GATGGCGGAT TTTTTTCTCC GGCTTGAAGG GGTCGATGTT GTGCTGGCGA TTGGCCGATT TCACAACGAA ATCATTCTTT CAGTGCGCAC CTCCCGGCAC GACGTTAATG CAGGAACGTT GATCAAGCGG CTCGTCGAGG GCAGAGGCGC CGCTGGAGGC CATGGCATGA TGGCGGGGGG GAAAATCGAA GGACTTTCCG ATTCCGCTGC GGAACTGGAG GATATCGAGC AACTCCTCGT CCGTCGTTTT ATCGATATCT TCGCGCTGGG GCACATTCGG TCGCTGTCGT TGAGTGCTCT CAGGCGTACG GCTCTGCCGG CCCTTGCCGC CGAGATTGAC AACCTGCATC ACATCGTTTA A
|
Protein sequence | MGEGCSLAKT KSYTDEMLNW VRGKGKILIV VHDNPDPDCL ASAIALRHLF VMKLNRDATI AFSGMIGRSE NIAMAKQLQI PLTPLALIDY QDFSVTCMLD TQPGAGNNSL PPDKRVDILI DHHPRREAGL RCRWDDIREE YGVTATIVYE YLQAQDVPIG SNLATALFYA IKSETQDLGR EAARPDRDAY LKLFPLANKK LLYEITYPKL HVEYYLTINR AIEHSRIYGN LLATNLQEVS FPEIVAEMAD FFLRLEGVDV VLAIGRFHNE IILSVRTSRH DVNAGTLIKR LVEGRGAAGG HGMMAGGKIE GLSDSAAELE DIEQLLVRRF IDIFALGHIR SLSLSALRRT ALPALAAEID NLHHIV
|
| |