Gene GSU0123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0123 
Symbol 
ID2687975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp137241 
End bp138326 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content66% 
IMG OID637124790 
Productnickel-dependent hydrogenase, small subunit 
Protein accessionNP_951185 
Protein GI39995234 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.799752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACAGC ATTGCAAGAG CTCGGGAACA TGGGAAGAAC AGGGGATTTC CCGGCGTGAC 
TTCCTGAAGT TCTGCACCGC AATGTCGGCG GTGCTGGCCC TGCCGGTTAC CCTGGTGCCT
CGAATCGCCG AGGCCCTGGA GGATGACAAG CGCCCCTCGG TCATCTGGCT CGAATTCCAG
AGCTGCACCG GCGATACCGA GGCACTCCTG AGAGCCGCCA ATCCCACCGT GGCCGACATC
GTCCTGGACG TCCTCTCCGT GGACTACGCC GAGACCATCA TGGCAGCGGC GGGCTTCCAG
GCCGAAGCTG CCAGGCACGC AACCATGAAA GCGCGATCCG GCACGTACAT CGCCGTGGTG
GAAGGAGCGA TTCCGACCGG AGCAAACGGC GCCTACTGCT GCATCGGCGG CCGGTCGGCC
CTGGACATCA CCCGCGAAGT CTGCGGCGGC GCCATGGCAA CTATCACCGT CGGCACCTGC
GCCTCCTACG GGGGGATACC GGCGGCGGCG CCCAACCCGA CCGGGGCGGT CGGCGTCAAG
GATGCGGTGC CGGGGGCCAC GGTCATCAAC CTCCCCGGCT GCCCCGTCAA CACCGACAAC
CTGGTGGCTA CCGTGGTCCA CCTGCTCACC TTCGGCTCCC CGCCGGCCAC CGACGGCAAG
GGCCGCCCTC TCTTCGCCTA CGGCAAGCGG ATTCACGATA ACTGCGAGCG ACGGCCCCAC
TTTGACGCGG GCCAGTATGT GGAACAATGG GGCGATCAGG CACACCGGGC GGGCCACTGC
CTCTACAAAA TGGGCTGCAA GGGCCCCGAA ACCTTCCACA ACTGCCCCTC CCAGCGCTAC
AACGAAAAGA CAGCCTGGCC GGTTTCGGCG GGCCATGGCT GCGCCGGCTG CTCCGAGCCC
CACTTCTGGG ATTCCATGAC ACCGCTCTAC AAGCGGCTCC CCAATGTGCC CGGCTTCGGC
ATAGAGGCTA CCGCCGACCA GATCGGCCTG GGAATCGCAG CGGCCACAGC GGCCGCATTC
GGCATCCACG GCGTGGTGAG CGCCCTGCGC AAGGGGAACG ATTCCGACGA TGAGAAGGAG
GGGTAG
 
Protein sequence
MKQHCKSSGT WEEQGISRRD FLKFCTAMSA VLALPVTLVP RIAEALEDDK RPSVIWLEFQ 
SCTGDTEALL RAANPTVADI VLDVLSVDYA ETIMAAAGFQ AEAARHATMK ARSGTYIAVV
EGAIPTGANG AYCCIGGRSA LDITREVCGG AMATITVGTC ASYGGIPAAA PNPTGAVGVK
DAVPGATVIN LPGCPVNTDN LVATVVHLLT FGSPPATDGK GRPLFAYGKR IHDNCERRPH
FDAGQYVEQW GDQAHRAGHC LYKMGCKGPE TFHNCPSQRY NEKTAWPVSA GHGCAGCSEP
HFWDSMTPLY KRLPNVPGFG IEATADQIGL GIAAATAAAF GIHGVVSALR KGNDSDDEKE
G