Gene GSU2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2002 
Symbol 
ID2688105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2194489 
End bp2195592 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content61% 
IMG OID637126693 
Producthypothetical protein 
Protein accessionNP_953051 
Protein GI39997100 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGC AGATGTATGA CGTCCTCATC GTGGGGGGCG GTCCGGCAGG GATCGCCTGT 
GCCTACCTGT GTCACAGAAA CGATCTCTCC TATCTCCTCA TCGAACAGGG GAAGAGTGTT
TTTCAGGGAA TCACCAACAC CTACCCCGAA GGGAAGAACG TCTACCCGTC GCGGCCCAAG
GAGAGCCCCG AGCCGTTCCT GGTTGAAGAG CTCCGCCCCC CCGACAAGCC GGTGGCCGTG
GAAAAGTACA TCCAGTATGT GCAGCACTTC GTCCAGCACG AGAACCTGAA TATCCGGACC
GAGGTCCAGT TCGAGGACCT AAAGGACGCC CGCGATCACC TCATCGTCCA GACCTCTGTG
GGGAATTTCG CCGCCCGCAA GGTGGTCCTG GCCTTCGGCA GCAGCATTCC CCGGGAACTT
TCGGTCTACG GCGATGCCAA GATGGTGGCC AAGACCCTGG ATGACCCGAA GAAGTACGTG
GGGGCCCGGA CCCTGGTCAT CGGCGGCGGC AACACGGCGG CGGACGTAAT CATTTCCATC
CTCAGGGCCA AACGCGAGGC CGGGGACACC CAGTCGGTCT ACTGGGCCCA TGTGGCGGAA
AAATTCGACG TGAACAAGGA GACCGCCCAG CGCCTGGGGG AGGAGATCCT CCTGGGTGGC
AATATCAGGC TGCTTCCCGG CGCCATCCCC CGCATCGGCG AGGTTGACCA GGAGGGGGTC
GACCGGCTCG TAATCCGGGT GAACGAGTTC ACCCAGCCCG ACGGCATCGA GATCTACCAT
GCCATGAGCT TCCCCATGAA GAACGTCATT GCCTGCATCG GCTCCCAGGG ACCGCTTCCT
ATCTTCGACA AGATCGGGGT CCAGACCATT GCCTGCGCCG AAGGAGTCTG CACCGTGGCC
AAAGAGGGGG ACCGGCTCAT CCTGCTCAAC GCCGAGTTCG AGTCGACCCG CAAGGGGGTC
TACGTCATCG GCGGCGCCAT CTCACCCTCG TTTATGAAGA TCTGCGGCGG CAGCATCCAG
GAGGAGAAGC ATCCCAACCT GATCTACACC GCAATCAACG ATGCCTTCCA CGTAGTGGAA
GCCGTCAAGA GGAAGCTTGC CTGA
 
Protein sequence
MEQQMYDVLI VGGGPAGIAC AYLCHRNDLS YLLIEQGKSV FQGITNTYPE GKNVYPSRPK 
ESPEPFLVEE LRPPDKPVAV EKYIQYVQHF VQHENLNIRT EVQFEDLKDA RDHLIVQTSV
GNFAARKVVL AFGSSIPREL SVYGDAKMVA KTLDDPKKYV GARTLVIGGG NTAADVIISI
LRAKREAGDT QSVYWAHVAE KFDVNKETAQ RLGEEILLGG NIRLLPGAIP RIGEVDQEGV
DRLVIRVNEF TQPDGIEIYH AMSFPMKNVI ACIGSQGPLP IFDKIGVQTI ACAEGVCTVA
KEGDRLILLN AEFESTRKGV YVIGGAISPS FMKICGGSIQ EEKHPNLIYT AINDAFHVVE
AVKRKLA