Gene GSU1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1005 
Symbol 
ID2685628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1085392 
End bp1086369 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content65% 
IMG OID637125675 
Productdihydrouridine synthase family protein 
Protein accessionNP_952059 
Protein GI39996108 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.667167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGTT CTCTTACCAT AGGCTCCCTG ACCCTGGGAA ACAATCTCAT CCTCGCCCCC 
ATGGCCGGGA TAACCAACCT TCCCTTCCGC CTCCTGGCCC GCGAACAGGG GGCAGGTCTC
TGCTTCACCG AGATGGTGAG CGTGAACGGC CTTGTCCGGG AAGGTAAAAA GAGTTTCGAA
CTCCTGCGGA GCGTGCCGGG GGATCGCCCC CTCGGCATCC AGCTTTTCGG GGACGACCCG
GACGTCATGG GCCGGGTGGC GGCCACTGTG GACGGATACG GGGACCTCAT CGACATCAAC
ATGGGATGCC CCGTGAAGAA GGTGGTGGGG ACCGGCGCCG GGAGCGCCCT CATGCGTGAG
CCGGACAAGG TGCGGGCCAT TGTCAGGGCC GTCCGGCGAG CCACGCGGCT GCCGCTGACC
GTGAAGATCA GGAGCGGGTG GAGCTGCGAA GATGCCAACT TTATCCAGAT TGCCCGGATT
GCCGAGGAAG AGGGATGCAA TGCAGTTACG CTCCATCCCC GGAGCAGGGC ACAGATGTTC
GAAGGCACGG CCGACTGGAC GAAGCTCGCC GAACTGAAGC AGGCCGTCGC CATACCGGTC
ATCGGCAGCG GCGACCTCTT CAGCGCGGCC GACGTGGCCG CCATGCTCGA CCGGACCGGC
TGCGACGGGG TCATGATCGC CCGAGGTGCT CTGGGAAATC CGTGGATCTT CAGGCAGGCC
CTGGACCTGA TGGCCGGACG CGAGCCGGCG GCGGCCTCCC CGGCCGAACG GTTGGCGGTG
GCCCGGAGGC ATCTGGCCCT GTTCACGGAA ATGGCCGGCG AACGGGTAGC CGCGAGAGAG
ATGCGCAAGC ACCTGGGGTG GTACTCCCAC GGACTCCCCG GTGCGGCACA GTTCCGGAAG
GAAATCAACG AGATTGAGGG CAATGGCGCC CTGATGGAAG CAGTGAGCCG CTTTTTCACG
GCTGTGGGGG CGCCATGA
 
Protein sequence
MIRSLTIGSL TLGNNLILAP MAGITNLPFR LLAREQGAGL CFTEMVSVNG LVREGKKSFE 
LLRSVPGDRP LGIQLFGDDP DVMGRVAATV DGYGDLIDIN MGCPVKKVVG TGAGSALMRE
PDKVRAIVRA VRRATRLPLT VKIRSGWSCE DANFIQIARI AEEEGCNAVT LHPRSRAQMF
EGTADWTKLA ELKQAVAIPV IGSGDLFSAA DVAAMLDRTG CDGVMIARGA LGNPWIFRQA
LDLMAGREPA AASPAERLAV ARRHLALFTE MAGERVAARE MRKHLGWYSH GLPGAAQFRK
EINEIEGNGA LMEAVSRFFT AVGAP