Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_4370 |
Symbol | |
ID | 5166956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 5062035 |
End bp | 5063198 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640551852 |
Product | cupin 2 domain-containing protein |
Protein accession | YP_001233086 |
Protein GI | 148266380 |
COG category | [S] Function unknown |
COG ID | [COG0599] Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit [COG1917] Uncharacterized conserved protein, contains double-stranded beta-helix domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAA AATTCACCAT ACTTCTGGCA AGTACACTTG CCGCAATATT CAGCATCGTA ACCTTTTCGG AGGCTCAGAC CGTGACAACA GGAGGTTTGA ACGCCAAGCA GGAAAACATT GTCACCATTG CGGCCTTTAC TGCCAGCGGC GACTTGCAAA AACTGAAAAC GGCCTTAAAC GATGGGCTGG ATGCCGGTTT GACCATCAAC GAAATCAAGG AAATCCTTGT GCAGATGTAC GCCTATGCCG GGTTCCCCCG CAGTCTGAAC GGGATCAATA CCTTCATCGG CGTCCTGGAA GAACGGGACA AGAAAGGGAT CAAGGATGTT CCCGGTAAGG AACCAAGCTC CATACCTGCC AACAGGAGCA GCATCGAACT TGGGACCGAA ATCCAGACAC ACCTGATAGG AGCGCCGGCC ACCGGGAAGT ATATTACCTT TGCCCCGGCC ATTGATGCGT TCTTGAAGGG ACACCTATTC GGGGACATCT TCGGGCGTGA CAACCTGGAT TACCAGAGCA GGGAGCTGGC AACCATTTCA GCCTTGGCGA GTATCGAGGG GGTCAATCCT CAGTTGCAGT CACATTTTAA CGTCGGACTG AATACCGGAC TGACCGAGGC GCAACTGCGG AGCCTGATAA CCGTTCTCGA AGCAAACGTT GGTAAAAAGG AAGCCGCAAA TGCTAGTGAA ACATTGGGTA AAGTTCTGAG TAACAGGCAG GCGGAGCAGA GAATCACTAT CGCTCGGAGC GGCTCCCTGC CTTCAAGCCA AGGTTCAGCC GAATACTTTT CAGGTTCCGT AAAAATCGAC ACGCTATTCA AAGCGCACGA ACCGGCACGT ACGACAGGCG GACTTGTCAC GTTCCAACCG GGTGCCCGGA CGGCGTGGCA CTCCCATCCG CTCGGCCAGA CTTTAATCGT GACAGCGGGC ACCGGCCGAA TACAACAGTG GGGTGGCCCG ATTGAGGAGA TCAGGCAGGG TGATGTCGTA CGGATTCCGC CCGGCGTAAA ACATTGGCAC GGAGCCGCGC CAAACACAGC CATGACTCAT ATCGCCATAG CAGAACAGCT TAATGGCAAT GCCGTCGAAT GGCTGGAAAA GGTTAGTGAC GAGCAGTACA ACCAACTGTC GTCTACACGA AAAAGGAGAA ACACATATGA GTAA
|
Protein sequence | MNKKFTILLA STLAAIFSIV TFSEAQTVTT GGLNAKQENI VTIAAFTASG DLQKLKTALN DGLDAGLTIN EIKEILVQMY AYAGFPRSLN GINTFIGVLE ERDKKGIKDV PGKEPSSIPA NRSSIELGTE IQTHLIGAPA TGKYITFAPA IDAFLKGHLF GDIFGRDNLD YQSRELATIS ALASIEGVNP QLQSHFNVGL NTGLTEAQLR SLITVLEANV GKKEAANASE TLGKVLSNRQ AEQRITIARS GSLPSSQGSA EYFSGSVKID TLFKAHEPAR TTGGLVTFQP GARTAWHSHP LGQTLIVTAG TGRIQQWGGP IEEIRQGDVV RIPPGVKHWH GAAPNTAMTH IAIAEQLNGN AVEWLEKVSD EQYNQLSSTR KRRNTYE
|
| |