Gene Ssed_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_1900 
Symbol 
ID5611131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp2284025 
End bp2285140 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content51% 
IMG OID640932786 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_001473639 
Protein GI157375039 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00044285 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAGCAC TGAAATCACT TTACCAAGGG TTCAGAGACC CTAAAACCAT TGCTAAACTG 
GCGGAGATGA TCGCCATAGA AGCGGCAAAA TGTTCAGAAC CCATTAACAT CATGGAAGTG
TGCGGTGGAC ATACTCACAC CATCATGAAA TATGGTTTGA ATCAGCTCTT GCCTGAGAAC
ATTAAGTTCA TCCATGGGCC CGGATGCCCG GTTTGTATCA TGCCTAAAGA GCGTATCGAC
CATGCCGCGA CACTTGCCAG TCTACCTAAT GTCATTCTCG TCACATTAGG CGACATGATC
AGGGTGCCGG GTTCGAAAGG CAGCCTCGCC GAGTTCAGGT CGAAAGGGTG CGACATTCGT
CCGATCTACG ATCCACTCGA TACTCTGGCT ATCGCCATCG ATAACCCGGA TAAAACCGTT
ATCTTCTTTG CTATCGGCTT TGAAACATCT ACCCCTATGA CAGCGGTTCT TCTCGAGCAG
GCAGAAAAGA GAAATATCGA TAACCTGCTG TTTCATATCA ACCATGTGTT AGTGCCACCG
GCAATGGATG CGGTCATGGC GGATCCCAAG GCGACGGTTA ACGCCTTTAT CGGTCCGGCT
CACGTCAGCG TGATAAGTGG CGCTAAGGTC TATCGTCCTG CCGTTGATAA TTACCATATG
CCTGTTGTGG TATCGGGTTT TGAACCTGTC GACGTTATGG AGTCCATCTT AAGAATTACC
AAGCAGAAGG CGCAAGGTGT GGCCGAGCTC GATGTGCAAT ATAGCCGAGC CGTGACCGAA
GAGGGAAACC TTGCCGCACA GGAGAAGAAC GAGCACTTCT TCGAAATAAG AGAAGATTTC
CGCTGGCGTG GACTCGGCCC GATCCCAAAT TCCGCCCTGA AACTCAGCTC ACAATATGCC
CACCGGGATG CTGAACTCAT CTACGCCGAC AGGTTGCCGG TAAAAGAGAT CGACGACCAT
AAAGCCTGTC AGTGCGGTGA TATCCTGCGC GGACTCGCTA ACCCGAAAGA TTGTAAAGTC
TTCGGGCGAG GTTGCAGTCC TGAATCACCA CTGGGCAGTT GTATGGTCAG CTCTGAAGGT
GCCTGTAACG CTTACTACCG CTACAACGGA GTTTAA
 
Protein sequence
MLALKSLYQG FRDPKTIAKL AEMIAIEAAK CSEPINIMEV CGGHTHTIMK YGLNQLLPEN 
IKFIHGPGCP VCIMPKERID HAATLASLPN VILVTLGDMI RVPGSKGSLA EFRSKGCDIR
PIYDPLDTLA IAIDNPDKTV IFFAIGFETS TPMTAVLLEQ AEKRNIDNLL FHINHVLVPP
AMDAVMADPK ATVNAFIGPA HVSVISGAKV YRPAVDNYHM PVVVSGFEPV DVMESILRIT
KQKAQGVAEL DVQYSRAVTE EGNLAAQEKN EHFFEIREDF RWRGLGPIPN SALKLSSQYA
HRDAELIYAD RLPVKEIDDH KACQCGDILR GLANPKDCKV FGRGCSPESP LGSCMVSSEG
ACNAYYRYNG V