Gene Sbal195_4218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_4218 
Symbol 
ID5756049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp4992531 
End bp4993421 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content48% 
IMG OID641290574 
Productformate dehydrogenase subunit FdhD 
Protein accessionYP_001556636 
Protein GI160877320 
COG category[C] Energy production and conversion 
COG ID[COG1526] Uncharacterized protein required for formate dehydrogenase activity 
TIGRFAM ID[TIGR00129] formate dehydrogenase family accessory protein FdhD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.072328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.431807 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCGG AAAACAGCAT GGGCACAGAT AAAGAAGTAG CGGCAGCTAA CCCCACTCAA 
AAGCCTCATC ATCAGTGTTC TTTTGTGAGA ACCCAAGCGG AAGTACCGTT AACCATTGCC
GTAAAAGCAG TGAATGAAGC TGGGGAAGTG CTGGATAAAT TTGTCGCCTG CGAACGTCCA
CTTACCGTTT ATTTAAACTG GCGTCCGATA GTGACGCTGA TGACACTAGG GGCAAAACCT
GAGTCTCTGG CGTTAGGTTA TCTTAAGAAC CAAGGCTTTA TTTCGGATGT GAGCCTGTTG
GACTCTGTTA TCGTCGATTG GGATGTGAGT TCTGCGGCTG TGGTTACCCG CGAACAAACC
GCCGATCTCG ACGAGAAACT CTCCGAAAAA ACCGTGACGT CAGGCTGCGG TCAAGGCACG
GTTTATGGCA GTTTCATGCA GGATTTAGAT AACATCAATT TACCCACTCC GAGTTTGAAG
CAAAGCACGC TGTATAGCTT ATTGAAAAAT ATCAACGAAT ATAACGAAAC CTATAAGAAT
GCTGGCGCTG TGCATGGTTG TGGTTTGTGC GAAGACGATA GGATTATGGC CTTCGTTGAA
GATGTCGGCC GCCATAACGC CGTTGATACC TTGGCAGGGG ATATGTGGCT GACGCAGGAT
CGTGGCGATA ACAAGATATT CTATACCACA GGCCGCTTAA CCTCTGAAAT GGTGATTAAA
GTCGCCAAGA TGGGGATCCC CATTTTGCTA TCACGTAGCG GCGTGACTCA GATGGGCTTA
GCACTGGCGC AACAGTTAGG CATTACTATT ATCGCCCGCG CTAAAGGTCG ACACTTTTTG
GTGTATCACG GCAGTGAAAA TCTGCAATTT GATGCCAATA CGGCTACTTA G
 
Protein sequence
MISENSMGTD KEVAAANPTQ KPHHQCSFVR TQAEVPLTIA VKAVNEAGEV LDKFVACERP 
LTVYLNWRPI VTLMTLGAKP ESLALGYLKN QGFISDVSLL DSVIVDWDVS SAAVVTREQT
ADLDEKLSEK TVTSGCGQGT VYGSFMQDLD NINLPTPSLK QSTLYSLLKN INEYNETYKN
AGAVHGCGLC EDDRIMAFVE DVGRHNAVDT LAGDMWLTQD RGDNKIFYTT GRLTSEMVIK
VAKMGIPILL SRSGVTQMGL ALAQQLGITI IARAKGRHFL VYHGSENLQF DANTAT