Gene Gura_4370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4370 
Symbol 
ID5166956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp5062035 
End bp5063198 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content53% 
IMG OID640551852 
Productcupin 2 domain-containing protein 
Protein accessionYP_001233086 
Protein GI148266380 
COG category[S] Function unknown 
COG ID[COG0599] Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit
[COG1917] Uncharacterized conserved protein, contains double-stranded beta-helix domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA AATTCACCAT ACTTCTGGCA AGTACACTTG CCGCAATATT CAGCATCGTA 
ACCTTTTCGG AGGCTCAGAC CGTGACAACA GGAGGTTTGA ACGCCAAGCA GGAAAACATT
GTCACCATTG CGGCCTTTAC TGCCAGCGGC GACTTGCAAA AACTGAAAAC GGCCTTAAAC
GATGGGCTGG ATGCCGGTTT GACCATCAAC GAAATCAAGG AAATCCTTGT GCAGATGTAC
GCCTATGCCG GGTTCCCCCG CAGTCTGAAC GGGATCAATA CCTTCATCGG CGTCCTGGAA
GAACGGGACA AGAAAGGGAT CAAGGATGTT CCCGGTAAGG AACCAAGCTC CATACCTGCC
AACAGGAGCA GCATCGAACT TGGGACCGAA ATCCAGACAC ACCTGATAGG AGCGCCGGCC
ACCGGGAAGT ATATTACCTT TGCCCCGGCC ATTGATGCGT TCTTGAAGGG ACACCTATTC
GGGGACATCT TCGGGCGTGA CAACCTGGAT TACCAGAGCA GGGAGCTGGC AACCATTTCA
GCCTTGGCGA GTATCGAGGG GGTCAATCCT CAGTTGCAGT CACATTTTAA CGTCGGACTG
AATACCGGAC TGACCGAGGC GCAACTGCGG AGCCTGATAA CCGTTCTCGA AGCAAACGTT
GGTAAAAAGG AAGCCGCAAA TGCTAGTGAA ACATTGGGTA AAGTTCTGAG TAACAGGCAG
GCGGAGCAGA GAATCACTAT CGCTCGGAGC GGCTCCCTGC CTTCAAGCCA AGGTTCAGCC
GAATACTTTT CAGGTTCCGT AAAAATCGAC ACGCTATTCA AAGCGCACGA ACCGGCACGT
ACGACAGGCG GACTTGTCAC GTTCCAACCG GGTGCCCGGA CGGCGTGGCA CTCCCATCCG
CTCGGCCAGA CTTTAATCGT GACAGCGGGC ACCGGCCGAA TACAACAGTG GGGTGGCCCG
ATTGAGGAGA TCAGGCAGGG TGATGTCGTA CGGATTCCGC CCGGCGTAAA ACATTGGCAC
GGAGCCGCGC CAAACACAGC CATGACTCAT ATCGCCATAG CAGAACAGCT TAATGGCAAT
GCCGTCGAAT GGCTGGAAAA GGTTAGTGAC GAGCAGTACA ACCAACTGTC GTCTACACGA
AAAAGGAGAA ACACATATGA GTAA
 
Protein sequence
MNKKFTILLA STLAAIFSIV TFSEAQTVTT GGLNAKQENI VTIAAFTASG DLQKLKTALN 
DGLDAGLTIN EIKEILVQMY AYAGFPRSLN GINTFIGVLE ERDKKGIKDV PGKEPSSIPA
NRSSIELGTE IQTHLIGAPA TGKYITFAPA IDAFLKGHLF GDIFGRDNLD YQSRELATIS
ALASIEGVNP QLQSHFNVGL NTGLTEAQLR SLITVLEANV GKKEAANASE TLGKVLSNRQ
AEQRITIARS GSLPSSQGSA EYFSGSVKID TLFKAHEPAR TTGGLVTFQP GARTAWHSHP
LGQTLIVTAG TGRIQQWGGP IEEIRQGDVV RIPPGVKHWH GAAPNTAMTH IAIAEQLNGN
AVEWLEKVSD EQYNQLSSTR KRRNTYE