Gene Gura_1985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1985 
Symbol 
ID5166532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2298117 
End bp2299790 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content51% 
IMG OID640549479 
ProductSel1 domain-containing protein 
Protein accessionYP_001230748 
Protein GI148264042 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000285463 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACAAG TGATATATGT ATGTCTGCAT GTGGGCCAAA CACTAAGCGT GCAGAGAGTG 
CGGCACGGAA GGAGGGAAAA GCTGGATTTG ATGGTTAAAC TGACTGCAAT CGTGTTGATC
TGCCTGGCAA CCACTTGTTT ATTTGCCGAT ACAAAGGCAG ACCCTCGTGC TGTCAGGAGC
GCAAATATTG CGGAGATCAG GAAACTGGCA ATTGAGGGTC ATGTTGATGC CCAGTTCTAT
ACGGGGTTTA TGTATGAAAA AGGGCAGGGC GTACTCCAGG ACTATGCCGA GGCGGTGAAA
TGGTATCTGA AAGCGGCCGA GCAGGGGCAT GCCGGTGCGC AAATCAATGT CGGCATCATG
TATTTCAAGG GGCAGGGGGT ATTACCGGAT TATGCCGAGG CGGCGAAATG GTATCGAAAA
GCAGCTCTTC AGGGGAATGC AAACGCTCAA TTCAATCTCG GTCTGATGTG CAACAAAGGT
CAAGGGGTAT CCCGGGACTA TGTCGAGGCG GCGAAATGGT ATCTGAAAGC AGCTGAACAG
GGGAATAGTG GTGCTCAATT CAATCTCGGT CTGATGTACT ACAAAGGGGA CGGGGTTGCA
CGGAACTTTG CCGAAGCCTT CACATGGTAC CGGAAGGCGG CCGAACAGGG GAATGCGGGG
GCCCAGTTCA GTCTGGGTTT AATGTATTAT AAAGGTCAAG GAGTGCCGAA GAATTTTGCC
GAGGCCGCCG CATGGTATCG TAAGTCTGCT GAGCAGGGGC ATGTAGGCGC CCAGTTTAAT
CTGGGGTACA TGTACGAAAT GGAGCAAGGT GCAGTCGGAG GGAATGCCGA AGCGGCAAAA
TGGTACCGGA AGGCTGCTGA GCAAGGACAC GCAGGCGCCC AGTCTAATCT GGGGTACATT
TATGATATCG GAGAAGGGGT GCCCCAGGAT CATGCCGAAG CGGCCAAATG GTACAGGAAG
GCAGCCGAAC AGGGAAATGC CGCTGCGCAA TTAAACCTTG GGATCATGTA TGATAATGGT
CATGGTATCT CCCAGGACAA TGCAGAAGCG GTCAAATGGT ATCGCAAGGC TGCGGAACAG
GGGGATATGA CCGCCCAATA CAATATGGGA GTCAAGTATG CCAATGGAAT CGGCGTGCCG
CGCAACAATG CCGAAGCTGT CGAATGGTAC CGGAAAGCCG CTGACCAGGG GCATGAAATT
TCACAGGTCA ATCTTGGCCA TTTATATGAA AATTCAGACG GCGTACCCCA GGACTATGCG
CAAGCACTCA AATGGTATGG TAAGGCTGCC GAACAGGAAA ATAGCGATGC CCAGTTCAGC
TTGGGGTTAA TGTATGCCAA AGGCCAGGGG ACGCCACAGA ACTACGCCGA AGCGGCCAAA
TGGTATAGAC GGGCGGCTGA CCTGGGGAAT GAGATTGCGT ATTATAATCT GGCAATTCTC
TACTATAAAG GTCTGGGTGT GGATCGGGAC TATGCCGAAA CAGTAAGATT GCTTAAGGAG
GTCGCCGATC AGGAAGATGC AAATGTTCAT TTCAGCCTGG GATATATGTA TTATAAGGGG
CAAGGGGTAA TCGAGGACCA TGCCGAAGCT TTGAAATGGT TCAGAAAAGC CGGTGATGAG
GGCCTTAAAG AGGCCATGAA CTATGTAAAT TCAATCGAAA AGAAGGTGAA ATGA
 
Protein sequence
MGQVIYVCLH VGQTLSVQRV RHGRREKLDL MVKLTAIVLI CLATTCLFAD TKADPRAVRS 
ANIAEIRKLA IEGHVDAQFY TGFMYEKGQG VLQDYAEAVK WYLKAAEQGH AGAQINVGIM
YFKGQGVLPD YAEAAKWYRK AALQGNANAQ FNLGLMCNKG QGVSRDYVEA AKWYLKAAEQ
GNSGAQFNLG LMYYKGDGVA RNFAEAFTWY RKAAEQGNAG AQFSLGLMYY KGQGVPKNFA
EAAAWYRKSA EQGHVGAQFN LGYMYEMEQG AVGGNAEAAK WYRKAAEQGH AGAQSNLGYI
YDIGEGVPQD HAEAAKWYRK AAEQGNAAAQ LNLGIMYDNG HGISQDNAEA VKWYRKAAEQ
GDMTAQYNMG VKYANGIGVP RNNAEAVEWY RKAADQGHEI SQVNLGHLYE NSDGVPQDYA
QALKWYGKAA EQENSDAQFS LGLMYAKGQG TPQNYAEAAK WYRRAADLGN EIAYYNLAIL
YYKGLGVDRD YAETVRLLKE VADQEDANVH FSLGYMYYKG QGVIEDHAEA LKWFRKAGDE
GLKEAMNYVN SIEKKVK