Gene Shewmr4_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1822 
Symbol 
ID4252396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2162698 
End bp2163834 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID638118433 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_733953 
Protein GI113970160 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.249477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACAC ATGCAGCCCT CTATGAACAG GGAAAGGCGC GCTTAGATGC ACTGCGCCAA 
TTCGCTCCCC GTCAACAGCA AACATTAACC GAAAAATTGC AACAACACGG CATTACCCGC
CGTGACTTTA TGAAGTGGAG CGCCATGGTG ACGGGTATGC TCGCCCTACC GCTGCCCTTT
AGTAACTTAG TCGCCGAAGC TGCCGAACTC GCCGACCGTG TACCCTTAAT CTGGTTACAC
ATGGCCGAAT GTACTGGCTG CTCCGAATCC TTAGTGCGCG CCGATACGCC CAATCTCGAT
TCGCTGATCT TCGATCATAT CTCGTTGGAA TATCACGAAA CCCTCATGGC CGCTGCTGGC
TGGCAAGCAG AGGAAAATCT CGAGCACGCC CTAGAGACCT ACAAGGGTCG TTACCTGCTC
GCCGTTGAAG GCGCGATACC GACCGCCAAT AACGGCAGCT TCTTAACCGT TGGCTGTAAA
GGCCATACTG GCTTAGAAAT TATCAAACAT GCCGCCGAAG GTGCTGCGGC GATTATTTCT
GTCGGCACCT GCGCTTCCTT CGGTGGTGTG CAAGCCGCCT ACCCCAACCC GACTGGGGCA
AAAGGGGTAC ACGAAGTTGT GAGCAAGCCT GTGATCAACT TAGGTGGCTG TCCACCGAGT
GAGAAAAACA TCGTCGGCAC CCTGATGTAT TTCATCATGT TCGGCAAATT ACCTGCGCTG
GATATGTTCA ACCGGCCGAA ATGGGCTTAT GGCGCACGGG TACACGATAA CTGTGAACGC
CGCGGCCGTT TCGATGCCGG TGAGTTCGTT GAAGAGTTTG GCGATCACGG TGCGAAGGAA
GGTTACTGCC TCTACAAAGT GGGTTGTAAA GGACCTTATA CCTATAACAA CTGCCCCACA
GAGCGCTTTA ACCACCATAC CAGCTGGCCA GTGTTAGCGG GCCACGGTTG TATGGGCTGC
TCAGAACCTA ACTTCTGGGA TGATATGGCC GACTTTGAAA AACCCCTTGG CCGTCAACTA
CTCCATGGAT TGGATGCGAC CGCAGACACA GTCGGTGCGG TGATTTTAAG CGCAACCGTC
GTCGGCATTG GAGCCCACGC CGTTGCCAGT ATTTTTGCCA AGCCGCTGGA GGAATAA
 
Protein sequence
MDTHAALYEQ GKARLDALRQ FAPRQQQTLT EKLQQHGITR RDFMKWSAMV TGMLALPLPF 
SNLVAEAAEL ADRVPLIWLH MAECTGCSES LVRADTPNLD SLIFDHISLE YHETLMAAAG
WQAEENLEHA LETYKGRYLL AVEGAIPTAN NGSFLTVGCK GHTGLEIIKH AAEGAAAIIS
VGTCASFGGV QAAYPNPTGA KGVHEVVSKP VINLGGCPPS EKNIVGTLMY FIMFGKLPAL
DMFNRPKWAY GARVHDNCER RGRFDAGEFV EEFGDHGAKE GYCLYKVGCK GPYTYNNCPT
ERFNHHTSWP VLAGHGCMGC SEPNFWDDMA DFEKPLGRQL LHGLDATADT VGAVILSATV
VGIGAHAVAS IFAKPLEE