Gene TM1040_2330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2330 
Symbol 
ID4078320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2450310 
End bp2451701 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content61% 
IMG OID638007652 
Productthreonine synthase 
Protein accessionYP_614324 
Protein GI99082170 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.351518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATA TCTCCACCCG CGGCCAAGCG CCCGAACTCA CCTTCGAAGA AGCCATGCTG 
ACCGGGCTTG CGCGCGACGG CGGGCTTTAT GTTCCGGCAG AAATCCCGAC GCTCTCGGCA
GAGGAAATCG CAGGCTTTGC CGGGCTGCCT TATGAGGAGG TCGCGTTTCG CGTGATGTGG
CCCTATGTGT CGGGGTCTTT CTCCGAAGAG GAATTCAAGG GCATCATCGC GCGCGCCTAT
GCCGGGTTTG AGCACGCCGC TCGCGCGCCG CTGAAACAGA TGGCGCCGAA CCACTTCCTG
TTGGAGCTCT TTCACGGCCC GACGTTGGCG TTCAAGGATT TCGCCATGCA GCTCATCGGT
CAGCTGTTTC AGGTCGCGCT CAAACGCCGG GGCGACAGCG TGACCATCGT GGGTGCCACT
TCTGGCGACA CCGGGTCCGC GGCGATCGAG GCGTTTCGCG GTCTGGACGC GGTCAACGTC
TTTATCATGT ATCCCCATGG CCGCGTCTCC GAGGTGCAGC GCCGCCAGAT GACCACACCG
CAGGACGCCA ATGTGCATGC CCTCGCGGTG GATGGAGACT TTGACGACTG CCAGGCTGCG
GTCAAAGACA TGTTCAACGA TTTTGACTTC CGCGATTCGG TGCATCTGGC GGGCGTGAAC
TCGATCAACT TTGCCCGCGT TTTGGCGCAG GTGGTCTATT ACTTCACCGC TGCCGTGGCC
TTGGGCGCAC CGCACCGCAA AGTGTCCTTC ACCGTGCCGA CCGGTAACTT TGGCGACATC
TTTGCGGGCT TTATCGCGCG CCAGATGGGG CTGCCGATCG ATCAGCTGGT GGTCGCCACC
AACCAGAACG ACATCCTGCA CCGCTGCCTC TCGGGCGAGG GCTATTTCAA AGGCGAGACC
ATCCCGTCGA TTTCGCCTTC TATGGATATT CAGGTCTCTT CGAACTTCGA GCGGGCCTTG
TTCTATGCCT ACGATCAGGA CGGCGCGGCT GTGGCGCAGC TGATGGACGA ACTGAAGACC
GGTGGTGGTT TTAACGTGAG CCAGGGGGCC ATGCAGGCGT TGAGCGAAAT CTACAGCTCA
GGCCGCGCTT CCGAGGAGGA GACTTCCGCC ACGATCAAAT CCGAACTCGC GGCCTCAGGA
GAGCTGCTTT GCCCACATGG GGCAGTTGGG GTGAAGGTCG CCAATGAACA CCTCAAGGAT
GGGGTGCCGA TGGTCACGCT GGCCACGGCG CATCCCGCAA AATTCCCGGC CGCGGTCGAG
GCGGCCTCGG AGGTGCATCC GCCTCTTCCC CCTCGCATGG CAGACCTGTA TGACAGATCG
GAGCGCGTGA CCCGGATCGC CAATGATCTC GGCGCGATTG AAGATCATAT CAGAAAGCAC
ATCGCCAATT GA
 
Protein sequence
MKYISTRGQA PELTFEEAML TGLARDGGLY VPAEIPTLSA EEIAGFAGLP YEEVAFRVMW 
PYVSGSFSEE EFKGIIARAY AGFEHAARAP LKQMAPNHFL LELFHGPTLA FKDFAMQLIG
QLFQVALKRR GDSVTIVGAT SGDTGSAAIE AFRGLDAVNV FIMYPHGRVS EVQRRQMTTP
QDANVHALAV DGDFDDCQAA VKDMFNDFDF RDSVHLAGVN SINFARVLAQ VVYYFTAAVA
LGAPHRKVSF TVPTGNFGDI FAGFIARQMG LPIDQLVVAT NQNDILHRCL SGEGYFKGET
IPSISPSMDI QVSSNFERAL FYAYDQDGAA VAQLMDELKT GGGFNVSQGA MQALSEIYSS
GRASEEETSA TIKSELAASG ELLCPHGAVG VKVANEHLKD GVPMVTLATA HPAKFPAAVE
AASEVHPPLP PRMADLYDRS ERVTRIANDL GAIEDHIRKH IAN