Gene GSU1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1089 
Symbol 
ID2686929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1174351 
End bp1175457 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content63% 
IMG OID637125758 
Productiron-sulfur cluster-binding protein 
Protein accessionNP_952142 
Protein GI39996191 
COG category[R] General function prediction only 
COG ID[COG2768] Uncharacterized Fe-S center protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0040129 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAGCA CCGTTTACTT CAGCGACATG CGGGCGGGAC ACAAGGAGAA CCTTTTCGCC 
AAGATCGGTA AACTCATGAT CCTGGCCGGT GCCAGGGAAC GGATCGCCAC GGGCGACCTG
GTGGCGGTAA AGGTCCACTT CGGAGAGCGG GGAAATCATG CGTTCATCCG CCCCATTTTT
ATCCGGCGCG TGGTGGACGA AATCAAAGGA TGCGGCGGAA AGCCCTTTCT CACCGACTCC
TCAACCCTCT ACCCCGGCGA GCGCAAGGAA GCGGTCTCCG CGCTGATCTG TGCCATCGAG
AACGGCTTCG ACTTTGCGGT TGCCGGCGCT CCCCTCGTCA TGTGCGACGG ACTCCGGGGC
AACTCGGCCA TTGTCGTTGA GGTGAACGGC GAACTGCTGA AGAAGGTCCC CATCGGCTCC
GCCATCGTCG AGGCCGACGC CCTGGTAGCC GTCTCCCACT TCAAGTGCCA TGAGTTGACC
GGCTTCGGCG GCGCCCTGAA GAACCTGGGC ATGGGCTGCT CAAGCCGCGA GGGGAAGCTG
ACCCAGCATT CCACCGTGGC GCCCAGGGTG GCCGAAAAAT ACTGCACCGG CTGCGGGCTC
TGCCTGAAGG CCTGTGCCCA CGACGCCATC GCCATCATCG AGGGGAAGGC CAAGATCGAC
CCGAAGGCGT GCGCCGGCTG CAGCCGCTGC ATCACCGTCT GCCCCACCAA GGCCATCACC
ATCCAGTGGA ACGAGGCCGC CGACCTGGTC ATGAAAAAGA TGGCCGAATT CGCCAAAGGG
GCCGTGACGG GCAAGCAGCA CAAGACCCTC TTCCTCAACT TCATCACCCA GGTCTCCCCG
GCCTGCGATT GCTACGGCCA CGCCGACGCC CCCATCGTGA ACGACATCGG CATCTGCGCC
TCCACCGACC CCGTTGCCCT GGACCAGGCC TGCGCCGACC TGGTCAATGA CGCCGTGGGC
AACCAGAATA CGGCGTTGGC CACCGGCCAT GAGCCGGGGG GTGACAAGTT CCGCGGGGTT
CACCCGGACA TCGATTGGGA GATTCAGCTG GAGCATGCCG AGAAGATCGG CATGGGGACG
CGCGAGTATG ATCTGGTGAG AATCTGA
 
Protein sequence
MPSTVYFSDM RAGHKENLFA KIGKLMILAG ARERIATGDL VAVKVHFGER GNHAFIRPIF 
IRRVVDEIKG CGGKPFLTDS STLYPGERKE AVSALICAIE NGFDFAVAGA PLVMCDGLRG
NSAIVVEVNG ELLKKVPIGS AIVEADALVA VSHFKCHELT GFGGALKNLG MGCSSREGKL
TQHSTVAPRV AEKYCTGCGL CLKACAHDAI AIIEGKAKID PKACAGCSRC ITVCPTKAIT
IQWNEAADLV MKKMAEFAKG AVTGKQHKTL FLNFITQVSP ACDCYGHADA PIVNDIGICA
STDPVALDQA CADLVNDAVG NQNTALATGH EPGGDKFRGV HPDIDWEIQL EHAEKIGMGT
REYDLVRI