Gene SO_3920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_3920 
SymbolhydA 
ID1171559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp4067806 
End bp4069038 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content49% 
IMG OID637345680 
Productperiplasmic Fe hydrogenase, large subunit 
Protein accessionNP_719451 
Protein GI24375408 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACGA CAACTTATCA ACCAGGAGAA ATCCAAGGGC TGATCAAGAT TAATGCATCC 
AAATGCAAAG GATGTGATGC CTGTAAACAA TTCTGCCCAA CCCATGCCAT TAATGGCGCT
TCGGGTGCAG TACACTCTAT CGATGAAGAT AAATGCTTAA GCTGCGGACA GTGTTTAATT
AACTGTCCAT TTAGCGCTAT TGAGGAAACC CACAGCGCAC TTGAAACCGT GATTAAAAAG
CTCGCTGATA AAAATACCAC CGTGGTCGGG ATTATCGCGC CTGCGGTACG GGTGGCGATT
GGTGAAGAAT TTGGCTTAGG TACAGGTGAG CTAGTAACAG GCAAACTCTA CGGTGCCATG
AATCAAGCTG GCTTTAAAAT TTTCGACTGT AACTTCGCCG CCGATTTGAC CATTATGGAA
GAAGGCAGTG AGTTTATTCA TCGCCTGCAC GCCAATGTAA AAGGTGAAGC TAACGCAGGC
CCATTGCCGC AATTTACCTC CTGCTGCCCA GGCTGGGTAC GCTACCTCGA AACCCGCTAC
CCTGCACTTT TACCTAACCT ATCGACCGCC AAATCACCTC AGCAAATGGC AGGGACTGTC
GCCAAAACCT ACGGCGCCAA GGTATATCAA ATGCAGCCAG AGAATATTTT CACTGTCTCT
GTAATGCCTT GCACCTCGAA AAAGCTCGAA GCCTCCCGTC CCGAATTTAA CTCGGCTTGG
CAATATCATC AGGAACACGG CGCAAACTCG CCCTCCTACC AAGATATTGA TGCCGTGCTC
ACCACAAGGG AAATGGCTCA GTTACTCAAA CTGCTCGATA TCGATCTCGC GAATACCGCG
GAATATCAAG GCGATAGTTT GTTCTCTGAA TACACTGGCG CGGGCACAAT TTTTGGAACA
ACCGGCGGGG TGATGGAAGC GGCGCTGCGT ACCGCCCATA AAGTACTGAC TGGAACTGAA
ATGGCTAAGC TGGAATTTGA ACCCGTACGC GGGCTAAAAG GCGTGAAATC AGCCTCTGTC
AGCCTGTTTG ATACAGAGCT TAACCAAGAT GTGACCGTCA ATGTCGCCGT AGTGCACGAC
ATGGGCAACA ACATTGAGCC CGTACTGCGC GATGTGATGG CTGGCACCTC TCCTTATCAC
TTTATTGAGG TGATGAACTG CGCTGGCGGT TGCGTCAACG GCGGAGGCCA ACCTATTGAA
GGTAAAGGCT CTTCATGGCT GGGTAACATT TAA
 
Protein sequence
MTTTTYQPGE IQGLIKINAS KCKGCDACKQ FCPTHAINGA SGAVHSIDED KCLSCGQCLI 
NCPFSAIEET HSALETVIKK LADKNTTVVG IIAPAVRVAI GEEFGLGTGE LVTGKLYGAM
NQAGFKIFDC NFAADLTIME EGSEFIHRLH ANVKGEANAG PLPQFTSCCP GWVRYLETRY
PALLPNLSTA KSPQQMAGTV AKTYGAKVYQ MQPENIFTVS VMPCTSKKLE ASRPEFNSAW
QYHQEHGANS PSYQDIDAVL TTREMAQLLK LLDIDLANTA EYQGDSLFSE YTGAGTIFGT
TGGVMEAALR TAHKVLTGTE MAKLEFEPVR GLKGVKSASV SLFDTELNQD VTVNVAVVHD
MGNNIEPVLR DVMAGTSPYH FIEVMNCAGG CVNGGGQPIE GKGSSWLGNI