Gene TM1040_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1962 
Symbol 
ID4077146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2065987 
End bp2067360 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content64% 
IMG OID638007277 
ProductL-serine ammonia-lyase / 2-hydroxymethylglutarate dehydratase 
Protein accessionYP_613956 
Protein GI99081802 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00720] L-serine dehydratase, iron-sulfur-dependent, single chain form 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.447397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCTCT CCGTTTTTGA CATGTTCAAA GTGGGCATCG GCCCGTCCTC CTCGCACACC 
ATGGGCCCGA TGGTGGCCGC GGCACGCTTT CTCGACATGA TGCGGGCCTC GCCCTTTGAC
TTTCACGGGC TGCGCGCATC GCTGCACGGC TCGCTGGCCT TTACCGGCGT CGGCCATGCC
ACCGACCGCG CCACCATTCT CGGGCTTGCT GGCTTTTTGC CCGACACCTA TGACGACGAC
AAAGCCGAGG CCGCGCTTGC GGCGATCACC GAGACCAAGA CCATCGTGCT CGAGGATCTC
GGCACGCTGC GCTTTGACCC CAAGGCGGAT ATGATATTTG ACTATGATCA CGCGCTTCCG
GGCCACGCCA ACGGCATGGT TCTGATGGCG CTGGATGCGC AGGGCGACGT GACCTTGCGG
CAGGTGTTCT ATTCCGTTGG CGGTGGCTTT GTGCTGACCG AGGAAGAGCT GGCACAGGGC
AAGGCCACGG ATGAAGGCGA CCCCGTCCCC TACCCGTTCA AATCCGCCGC CGAGATGCTC
GAGATGGCAA AGACCTCCGG GCTCTCGATC GCCCAGATGA AACGCGCCAA TGAAGTCTCG
CGCGGCTGCG AGCAGAGTTT TGCCAAGGGT ACACAGCGCC TGTGGCAGGT AATGAACGAC
TGTATCAACC GCGGGCTGGA GCGTGACGGC ATCCTGCCCG GCGGTCTGAG CGTGCGCCGC
CGCGCCAAGG GTATCTATGA TGCGCTGATG GCCGAGCGCG GCATGAACCT GACCGCGCCG
CACACCATCA ACGACTGGAT GAGCGTCTAT GCTATGGCCG TGAACGAGGA AAACGCCGCA
GGCGGACAGG TCGTGACGGC CCCCACCAAT GGCGCGGCGG GCGTCCTGCC AGCGGTGCTG
CGCTATTACC TTGATCACGT GCCGGGCGCG TCGGAAAAAC ACATCGAGGA TTTCCTGCTG
ACCGCAGCCG CCATTGGCGG GCTGGTCAAA TACAACGCCT CGATCTCTGG CGCCGAAGCG
GGTTGTCAGG CCGAAGTCGG CTCTGCCTCT GCCATGGCGG CGGCGGGGCT CTGTGCGGTG
ATGGGTGGCA CGCCGGAACA GGTGGAGAAC GCGGCCGAGA TTGCGCTCGA ACACCACCTC
GGCATGACCT GCGACCCGGT CAAAGGGTTG GTTCAGGTGC CCTGCATCGA GCGCAACGGT
CTCGGCGCGA TCAAGGCGGT TTCGGCAGCG TCCCTGGCGC TGCGCGGCGA CGGGCAGCAT
TTTGTGCCGC TGGATGCCGT CATCGAAACC ATGCGCCAGA CCGGGGCCGA CATGCACGAG
AAATACAAGG AAACCTCGCT TGGGGGCCTC GCCGTCAACG TCCCCAACTG CTGA
 
Protein sequence
MFLSVFDMFK VGIGPSSSHT MGPMVAAARF LDMMRASPFD FHGLRASLHG SLAFTGVGHA 
TDRATILGLA GFLPDTYDDD KAEAALAAIT ETKTIVLEDL GTLRFDPKAD MIFDYDHALP
GHANGMVLMA LDAQGDVTLR QVFYSVGGGF VLTEEELAQG KATDEGDPVP YPFKSAAEML
EMAKTSGLSI AQMKRANEVS RGCEQSFAKG TQRLWQVMND CINRGLERDG ILPGGLSVRR
RAKGIYDALM AERGMNLTAP HTINDWMSVY AMAVNEENAA GGQVVTAPTN GAAGVLPAVL
RYYLDHVPGA SEKHIEDFLL TAAAIGGLVK YNASISGAEA GCQAEVGSAS AMAAAGLCAV
MGGTPEQVEN AAEIALEHHL GMTCDPVKGL VQVPCIERNG LGAIKAVSAA SLALRGDGQH
FVPLDAVIET MRQTGADMHE KYKETSLGGL AVNVPNC