Gene TM1040_0969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0969 
Symbol 
ID4077265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1035985 
End bp1037280 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content61% 
IMG OID638006272 
Productsarcosine oxidase beta subunit family protein 
Protein accessionYP_612964 
Protein GI99080810 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCCC GGTCATATCT TGAGGCCAGA CACGAAGCAT TGGATTCGCT CATGAAACGC 
TATTCAGCCT TTGCAGTGGC GCGGGAAGGC CTTCGCTACC ACAGCGGATG GGAACGCGCC
TGGCGCTCCC CGGAACCCAA ACGTCACTAT GACGTCATCA TCGTAGGCGC GGGTGGCCAT
GGGCTCGCTA CGGCCTATTA CCTTGGGAAA AACTTTGGCA TCACCAATGT TGCGGTGATC
GAAAAGGGTT GGCTGGGGGG CGGCAATACG GGCCGCAACA CCACCATCAT CCGTTCGAAC
TACCTGCAGG ATCCCTCTGC CGCGATCTAC GAGAAATCGC GCAGCCTCTA TGAGGATCTG
TCGCAGGACT TCAACTACAA CATCATGTTC AGCCCGCGCG GCGTGATCAT GCTGGCGCAG
ACCGAGCACG AGGTGCGTGG TTACAAGCGC ACCGCCCATG CCAATGCGCT CCAGGGCGTG
TCGACCGAAT GGATCGAACC CGCCCGCGTG AAGGAACTGG TGCCGATCAT CAACCTCGAA
GGTCCGCGCT ATCCGGTCCT TGGCGGGCTC TGGCAAGCGC GTGGCGGTAC CGCCCGTCAC
GATGCGGTGG CCTGGGGCTA TGCGCGGGCC TGCTCGGCGA TGGGCATGGA CATCATCCAG
AAATGCGAAG TCACCAATGT TCGGACTGAA AACGGCCGCG TGGTAGGTGT CGACACCACC
AAAGGGGCGA TCGACTGCGA CAAGCTGGGC ATGGTGGTTG CGGGCAACTG TTCGGTGCTG
TCTGAAATGG CGGGCTTCCG TCTGCCGGTG GAATCGGTGG CGCTGCAGGC GCTGGTCTCC
GAGCCGATCA AACCCTGCAT GGACGTGGTC GTGATGGCCA ACACCGTGCA TGGCTACATG
TCGCAATCCG ACAAGGGCGA GATGGTCATT GGTGGCGGCA CCGACGGCTA CAACAACTAC
ACCCAGCGCG GTTCTTTCCA CCACATCGAG GAAACCGTGC GCGCCCTCAA CGAGACTTTC
CCGATGGTGT CGCGCCTCAA GATGCTGCGC CAATGGGGTG GGATCGTGGA TGTAACCGGC
GACCGCTCGC CGCTGATTTC CAAAACGCCG GTTCAGAACT GTTTTGTCAA CGCTGGCTGG
GGCACCGGCG GCTTCAAGGC GATCCCCGGC TCGGGCTGGG CGATGGCGGA ACTGATGGCG
ACAGGGCATT CCAACCTCGC GGAAGAGTTC TCCATGATGC GCTTCAAAGA AGGCAAATTC
ATCGACGAGA GCGTCGCAGC AGGGGTGGCA CACTGA
 
Protein sequence
MRSRSYLEAR HEALDSLMKR YSAFAVAREG LRYHSGWERA WRSPEPKRHY DVIIVGAGGH 
GLATAYYLGK NFGITNVAVI EKGWLGGGNT GRNTTIIRSN YLQDPSAAIY EKSRSLYEDL
SQDFNYNIMF SPRGVIMLAQ TEHEVRGYKR TAHANALQGV STEWIEPARV KELVPIINLE
GPRYPVLGGL WQARGGTARH DAVAWGYARA CSAMGMDIIQ KCEVTNVRTE NGRVVGVDTT
KGAIDCDKLG MVVAGNCSVL SEMAGFRLPV ESVALQALVS EPIKPCMDVV VMANTVHGYM
SQSDKGEMVI GGGTDGYNNY TQRGSFHHIE ETVRALNETF PMVSRLKMLR QWGGIVDVTG
DRSPLISKTP VQNCFVNAGW GTGGFKAIPG SGWAMAELMA TGHSNLAEEF SMMRFKEGKF
IDESVAAGVA H