Gene TM1040_3375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3375 
Symbol 
ID4075274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp389009 
End bp390259 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content60% 
IMG OID638004883 
Productsarcosine oxidase beta subunit family protein 
Protein accessionYP_611609 
Protein GI99078351 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.480932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTACT CTGCTTTGAG ACTGATCAAG GAAAGCCTGA CCGGACACAG GGGCTGGGGG 
CCTCAATGGC GCGACCCTGA CCCACAAGCG TCCTATGATT ACGTCATCAT CGGGGGCGGG
GGACACGGAT TGGCCACCGC TTACTATCTG GCCAAAGAGT TTCAGGGCCG CCGGATTGCG
GTTCTGGAAA AGGGCTGGAT TGGCGGTGGC AACGTCGGGC GCAACACGAC GATCATTCGC
TCCAACTACC TTCTGGACGG CAACGAGCCG TTCTACGAGT TCTCGCTGAA GCTTTGGGAA
GGGTTGGAGC AGGACCTGAA CTATAATGCC ATGGTAAGCC AGCGTGGCAT TCTCAACCTT
GTGCACACCG ATGCCCAGCG CGATGCCGCG CGGCGGCGCG GGAATGCGAT GATCCTGAAC
GGATCGGATG CGGAACTCCT CGACACTGAT GGCGTCCGCG CGCTCTATCC GTTCCTGAAT
TTCGAAAATG CCCGCTTCCC GATCAAGGGT GGCCTCCTGC ACCGGCGCGG TGGGACCGTG
CGGCATGACG CTGTTGCCTG GGGCTACGCC CGCGGCGCAG ATCAGCTGGG CGTGGACATC
ATCCAGAACT GCGAAGTCAC CGGCTTCAGG GTGGAAAACG GCCGCGTAAC AGGCGTTGAA
ACATCACGCG GACTGATTCG CGCTGCAAAA GTCGGCGTAT CCGTCGCGGG CAGCTCGAGC
CGCGTGATGG CGATGGCCGG AATGCGCTTG CCAATCGAAA GCCACGTGCT GCAGGCCTTT
GTGTCCGAGG GACTCAAACC CTTCATTCGG GGGGTCATCA CTTATGGCGC GGGACATTTC
TATTGCAGCC AATCCGACAA GGGCGGGCTG GTGTTTGGCG GCGATATAGA CGGCTACAAT
TCATATGCAC AACGCGGCAA CCTTCCGGTG GTCGAAGATG TCGTCGAAAG CGGCATGTCA
CTGATCCCCG GTCTGGGGCG CGCACGGCTG CTGCGCAGTT GGGGCGGCAT CATGGATATG
TCCATGGACG GCTCCCCCTT CATCGACAAG ACCCATATCG AAGGCCTCTA TTTCAACGGT
GGCTGGTGCT ATGGCGGCTT CAAGGCAACA CCCGCCGCAG GCTTTTGTTT TGCGCATCTC
CTGAAGACCG ACCGCCCACA TGAAACCGCC AAAGCCTATC GGCTCGACCG GTTCATGACG
GGGCACATGA TCGACGAAAA GGGCCAAGGC GCCCAGCCCA ACCTTCACTA A
 
Protein sequence
MRYSALRLIK ESLTGHRGWG PQWRDPDPQA SYDYVIIGGG GHGLATAYYL AKEFQGRRIA 
VLEKGWIGGG NVGRNTTIIR SNYLLDGNEP FYEFSLKLWE GLEQDLNYNA MVSQRGILNL
VHTDAQRDAA RRRGNAMILN GSDAELLDTD GVRALYPFLN FENARFPIKG GLLHRRGGTV
RHDAVAWGYA RGADQLGVDI IQNCEVTGFR VENGRVTGVE TSRGLIRAAK VGVSVAGSSS
RVMAMAGMRL PIESHVLQAF VSEGLKPFIR GVITYGAGHF YCSQSDKGGL VFGGDIDGYN
SYAQRGNLPV VEDVVESGMS LIPGLGRARL LRSWGGIMDM SMDGSPFIDK THIEGLYFNG
GWCYGGFKAT PAAGFCFAHL LKTDRPHETA KAYRLDRFMT GHMIDEKGQG AQPNLH