Gene TM1040_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0403 
Symbol 
ID4078797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp412303 
End bp413517 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content60% 
IMG OID638005698 
Productformate dehydrogenase gamma subunit 
Protein accessionYP_612398 
Protein GI99080244 
COG category[C] Energy production and conversion 
COG ID[COG2864] Cytochrome b subunit of formate dehydrogenase 
TIGRFAM ID[TIGR01583] formate dehydrogenase, gamma subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0151738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCC TGATTTTCGT ATTCGTCCTC AAATTGGCCT TGCTTTGCGG CCTTTCTGTC 
AACGCGCAGG AGGCGAGCCC AGCTGCGCCA GACCGCTCCG CCACCGGCGG CGCGCAGACG
CTCGAGGATA TTCTCGCGCG GCAACGTGGC GAGGATCTGG ACGACAGCTT CCGCAGCGAG
GCCACGGGCA ACCCCGAGCT TGCCGCCGGG ATTGAACAGC AGCTTGGCAC CCTTGGAGGC
CAGTCCGACC CCGAACTCTG GCGCGCATTG CGCTATAATT CTGCCGACAT CAGTGTTTCG
GCCGGCGGCG ATGTTGCGAC GGTTCTGGTG CAGGATGGCG GCATGAGCTG GCTGAGCTTC
CGCGAAGGAC CGCTGCGCCA ATATGGCGGG TGGCTCCTTT TAGGCACGAT TGCCGCGCTC
GCCGTCTTCT TTGTGCTGCG GGGCCGGATC GGTATCGATG CAGACAAGAC CGGACGCACC
GTGACCCGCT TTGCCTTTGT TGAGCGCATG GCCCATTGGA CGCTTGCGGG ATCGTTCATC
CTGCTGGGTA TCACCGGGCT TTTGACGCTC TTTGGGCGCG TCGCAATCCT GCCCTACCTG
GGCAAGGATG TGAATTCGGT GCTGCTCATC GGCTCCAAGT GGATCCACAA CTCCGTATCG
TGGGCCTTCA TGCTGGCGCT GATCGTGGTC TTTTTCATGT GGGTGGCGCA TAACATCCCA
AACAAGCTCG ATCTTGTCTG GCTCAAGCAG TTCGGTGGTA TCATCGGCAG CAAGCACCCG
CCCGCAAAGA AATTCAACGC AGGGCAGAAG GTGATCTTCT GGTCGGTGAT CTTGCTCGGG
GCCTCGATTT CCGTCTCTGG TCTGTCGCTG CTGTTTCCCT TCGAGATGCC GCTCTTTGCC
AAGACCTTTG CCGCTCTGAA CAGCACGGGA ATCCCGGAAT TGCTGGGCTT TGGGGTTCTT
CCCACCGAGC TTACGCCACA TGGCGAAATG CAGCTTGCAC AGCTCTGGCA CGCCATTGTC
GGCTTTGTCC TTATGGCGAT CATCATCGCG CATATTTATA TCGGCTCTGT CGGCATGGAA
GGCGCCTATG ATGCCATGGG GTCGGGCGAA GTGGATGAGG CCTGGGCCCA TCAGCACCAT
TCGATCTGGC TCGAAGAAGT GAAGGAAAAA GAGCGCGCAA AAGGCAGCGA CGCGAGCGCC
ACTCCGGCGG AGTGA
 
Protein sequence
MPRLIFVFVL KLALLCGLSV NAQEASPAAP DRSATGGAQT LEDILARQRG EDLDDSFRSE 
ATGNPELAAG IEQQLGTLGG QSDPELWRAL RYNSADISVS AGGDVATVLV QDGGMSWLSF
REGPLRQYGG WLLLGTIAAL AVFFVLRGRI GIDADKTGRT VTRFAFVERM AHWTLAGSFI
LLGITGLLTL FGRVAILPYL GKDVNSVLLI GSKWIHNSVS WAFMLALIVV FFMWVAHNIP
NKLDLVWLKQ FGGIIGSKHP PAKKFNAGQK VIFWSVILLG ASISVSGLSL LFPFEMPLFA
KTFAALNSTG IPELLGFGVL PTELTPHGEM QLAQLWHAIV GFVLMAIIIA HIYIGSVGME
GAYDAMGSGE VDEAWAHQHH SIWLEEVKEK ERAKGSDASA TPAE