Gene Sama_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1678 
Symbol 
ID4603929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2051729 
End bp2053432 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content56% 
IMG OID639781041 
Producthydrogenase large chain 
Protein accessionYP_927554 
Protein GI119774814 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAC GTGTTGTTAT CGATCCCATC ACCCGTATCG AGGGCCATTT GCGCGTCGAG 
GTCGAGGTAG ATGAGAACAA TGTGATTCAG AACGCCTGGT CCTCTTCCAC CCTGTGGCGC
GGCATTGAGG TTATCCTTAA GGGCCGTACG CCCATGGACG TGGGCCTGAT TGTGCAGCGT
ATTTGCGGTG TGTGTACCTA TTCCCACTAT CGCTGTGGTA CAGAAGCGGT GGAAAACGCC
CTCGGGGTAA AAATTCCACT CAACGCCAAG TACCTGCGTT CACTGATGCA AACGTCGCTC
TATATGCACG ACCATATAGT GCACTTCTAT CACCTCCATG GTCTCGATTG GGTGGATGTG
GTGTCTGCAC TCAGTGCCGA CCCAGCCAAG GCCGCTCAGG TGGCGCTCAA GTACACCGAT
AAGCCTATCG CGGCCGGTGA AGGTGAGCTT CGCGCCGTGC AGGAGCGGGT GAAAGGCTTT
GTGGAAAAAG GCCAGCTTGG CCCCTTCGCC AATGCCTACT GGGGCAATGG CACCTATAAG
TTTACCCCCG AGCAAAACCT GATTGCCCTG TCCCACTACC TCAAAGCGCT GGAAGTGCAG
CGCGTGGCGG CTGAAATGCT CGCCATCTTT GGTGGTAAGC AGCCTCACCC ACAATCTCTG
GTGGTGGGCG GCGTGACCTC AGTACGTGAC ATGTTAAGCC CTGCCCGTCT GCAGGAATGG
AAGCAAAAAC ATGCCTATGT GACCGACTTT ATCAAACGTG CTTATCAGGC CGATATTGTC
ATGGCTGCTG AAGCCTTCGG TACTGAGCCT TCAGTGCTTG GCGGCGTGAA CGTGAAAAAC
TTCCTGGCCA CAGATGACTT CGTCCTAGCC GATGGCGAGT ACATGTTTAA CGGCGGCGTG
ATAATGAACG GTGACCTGGC CGGTGTCAGC GACATCAACC CGGATTTGAT TGCCGAAGAC
GTCACCCACG CCTGGTACAG TGCCGACAAC GCCCAGCACC CCTATGATGG CACTACAGTG
CCTAACTATA CGGGCTTTGT GGAGCGTGAC ACCGTGTACG GCAAACTGCC TACCCTCGAT
GGTGACGGCA AGTACTCCTG GGTGAAGTCG CCACGCTATA ACGGCGAGCC TGTGGAAGTG
GGTCCACTCT CCAGCCTGCT GGTCAGTTAC GCCCGTGGCA ACAAGGTGGT CGTGGATGCC
GTTAACGGTC TTCTCGCCCG CACTGGCCTG CCTGTGGAAG CGTTGTTCAC CACTCTTGGC
CGCACTGCGG CGCGCATGCT GCAAACCGTG ATAGTGGCCG ATGAAGGTCT GCGTACCTTC
GATGCTCTGC TGACCAACAT TCAGTCAGAC GAAGCCACCT ATGTGAAGCC TGAAATCGAT
GCCAACAAAG AGTACGTTGG TCACGCCATG ATTGAAGCGC CTCGCGGTAT GCTGAGTCAC
TGGATCCGCA TCAAGGGCGG TAAAGTCGAA AACTATCAGG CTGTGGTGCC CACCACCTGG
AATGCGGGTC CCAAGGATGC CAACGGCAAG ATAGGCCCTT ATGAGGCATC GCTGATTGGC
TTAAAGCTTG AGGACCCCAC CAAACCGCTG GAGGTCATCC GCATCATCCA CTCCTTCGAC
CCCTGCATGG CTTGCTCGGT ACACGTAATG GACTTCAAGG GTGCAAGCCT GTCAGAGTTC
AAGGTGTCAC CAAACGGGCT CTAA
 
Protein sequence
MSQRVVIDPI TRIEGHLRVE VEVDENNVIQ NAWSSSTLWR GIEVILKGRT PMDVGLIVQR 
ICGVCTYSHY RCGTEAVENA LGVKIPLNAK YLRSLMQTSL YMHDHIVHFY HLHGLDWVDV
VSALSADPAK AAQVALKYTD KPIAAGEGEL RAVQERVKGF VEKGQLGPFA NAYWGNGTYK
FTPEQNLIAL SHYLKALEVQ RVAAEMLAIF GGKQPHPQSL VVGGVTSVRD MLSPARLQEW
KQKHAYVTDF IKRAYQADIV MAAEAFGTEP SVLGGVNVKN FLATDDFVLA DGEYMFNGGV
IMNGDLAGVS DINPDLIAED VTHAWYSADN AQHPYDGTTV PNYTGFVERD TVYGKLPTLD
GDGKYSWVKS PRYNGEPVEV GPLSSLLVSY ARGNKVVVDA VNGLLARTGL PVEALFTTLG
RTAARMLQTV IVADEGLRTF DALLTNIQSD EATYVKPEID ANKEYVGHAM IEAPRGMLSH
WIRIKGGKVE NYQAVVPTTW NAGPKDANGK IGPYEASLIG LKLEDPTKPL EVIRIIHSFD
PCMACSVHVM DFKGASLSEF KVSPNGL