Gene Shew185_4100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew185_4100 
Symbol 
ID5372935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS185 
KingdomBacteria 
Replicon accessionNC_009665 
Strand
Start bp4891901 
End bp4892791 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content48% 
IMG OID640832366 
Productformate dehydrogenase subunit FdhD 
Protein accessionYP_001368280 
Protein GI153002599 
COG category[C] Energy production and conversion 
COG ID[COG1526] Uncharacterized protein required for formate dehydrogenase activity 
TIGRFAM ID[TIGR00129] formate dehydrogenase family accessory protein FdhD 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.015822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCGG AAAACAGCAT GGGCACAGAT AAAGAAGTAG CGGCAGCTAA CCCCACTCAG 
AAGCCTCATC ATCAGTGTTC TTTTGTGAGA ACCCAAGCGG AAGTACCGTT AACCATTGCC
GTAAAAGCAG TGAATGAAGC TGGGGAAGTG CTAGATAAAT TTGTCGCCTG CGAACGTCCA
CTTACCGTTT ATTTAAACTG GCGTCCGATA GTGACGCTGA TGACCCTAGG GGCAAAACCT
GAGTCTCTGG CGTTAGGTTA TCTTAAGAAC CAAGGCTTTA TTTCGGATGT GAGCCTGCTG
GACTCTGTCA TCGTCGATTG GGATGTGAGT TCTGCGGCTG TGGTTACCCG TGAACAAACT
GCCGATCTCG ACGAGAAGCT CTCTGAAAAA ACCGTGACGT CAGGCTGTGG TCAAGGCACG
GTTTATGGCA GTTTCATGCA GGATCTAGAT AACATCAATT TACCTACACC GAGTTTGAAG
CAAAGCACGC TGTATAGCTT ATTGAAAAAT ATCAACGAAT ATAACGAAAC CTATAAGAAT
GCTGGCGCTG TGCATGGCTG TGGTTTGTGC GAAGACGATA GGATTATGGC CTTCGTTGAA
GATGTCGGCC GCCATAACGC CGTTGATACC TTGGCAGGGG ATATGTGGCT GACGCAGGAT
CGTGGTGATA ACAAGATTTT TTATACCACA GGCCGATTAA CCTCTGAAAT GGTGATTAAA
GTCGCCAAGA TGGGGATCCC CATTTTGCTA TCACGTAGCG GCGTCACTCA GATGGGCTTA
GCACTGGCGC AGCAGTTAGG CATTACTATT ATCGCCCGCG CTAAAGGTCG ACACTTTTTG
GTGTATCACG GCAGTGAAAA TCTGCAATTT GATGCCAATA CGGCTCCTTA G
 
Protein sequence
MISENSMGTD KEVAAANPTQ KPHHQCSFVR TQAEVPLTIA VKAVNEAGEV LDKFVACERP 
LTVYLNWRPI VTLMTLGAKP ESLALGYLKN QGFISDVSLL DSVIVDWDVS SAAVVTREQT
ADLDEKLSEK TVTSGCGQGT VYGSFMQDLD NINLPTPSLK QSTLYSLLKN INEYNETYKN
AGAVHGCGLC EDDRIMAFVE DVGRHNAVDT LAGDMWLTQD RGDNKIFYTT GRLTSEMVIK
VAKMGIPILL SRSGVTQMGL ALAQQLGITI IARAKGRHFL VYHGSENLQF DANTAP