Gene TM1040_2885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2885 
Symbol 
ID4076419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3054365 
End bp3055591 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content59% 
IMG OID638008214 
Productthreonine dehydratase 
Protein accessionYP_614879 
Protein GI99082725 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR02079] threonine dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.134652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACT TCAGAACCCA GGCCAAAGCG GCCGAGGCGG CGATGCGCGA TGTCTTCCCG 
CCCACGCCGC TGCAACGCAA CGAACTCCTG TCGCAGCGCT TTGGTGCGGA GATTTACCTC
AAACGCGAGG ATCTCAGCCC GGTGCGCTCC TACAAGATTC GCGGCGCGCT CAACGCGATG
CGAAAGCAGA CCAGCAAGGA TCTGTTTGTC TGTGCTTCGG CGGGAAATCA CGCGCAGGGC
ATGGCTTACA TGTGTCGGCA AATGGACAAA CGGGGTGTGA TCTTTATGCC GGTGACCACA
CCTCAGCAGA AGATCCAGAA GACCCGCATG TTTGGCGGTG ACAACGTCGA GGTGCATCTG
ATCGGGGATT ACTTCGACGA TACATTGTCC GCCGCGCAGG CGTGGTGCGC AGAAAAAGGA
GGGTATTTCC TCTCTCCTTT TGACGACGCG GATGTCATCG AAGGTCAGGC ATCAATCGCA
GTGGAAATCG AGGCGCAACT CGGCAAGGCG CCGGATCATA TCGTTCTACC GGTGGGCGGC
GGGGGTATGT CCTCTGGCGT GGTTCGGTAT TTTGGCGAGG ATGTGCATGC GCTTCTTGTG
GAGCCTGAGG GCGGCGCCTG CCTGAAGGCG GCGCTTGAGG CAGGTCATCC CACACCGCTC
AACCGGGTGG ATACCTTTGT GGACGGCGCC GCCGTTGGCA AAATTGGCGA GCGGCCTTTT
GATATTTTGC GCAGCGTGCC GTTGCCGGAT GTCCTGACTG TGTCAGAGGA CCGGATCTGC
ACCACCATCC TCGAGATGCT CAATGTCGAA GGTATCGTTC TGGAACCTGC AGGCGCGCTC
GCCGTCGAGG CGCTGGGCGA CCTGCGCACA TGGATTAAGG GTAAGACCGT GGTTTGTCTG
ACTTCCGGCG GGAACTTTGA TTTCGAGCGA CTGCCAGAGG TCAAGGAACG GGCGCAGCGC
TATTCCGGGG TGAAGAAATA CTTCCTGCTG CGCCTGCCGC AACGTCCCGG CGCGCTCAAG
GAATTCCTGA ACATTCTTGG GCCGGATGAT GACATCGCAC GTTTTGAATA CATGAAGAAG
TCTGCGCGCA ACTTCGGCTC GGTCCTGATC GGGATCGAGA CCAAGCGCCC AGAGAACTTC
GCGCCGCTGT TTGCGCAACT GGATGCGGCG GGTTTCACCT ACACCGACAT CACAAATGAC
GAGACACTGG CGCAGTTCGT GATCTGA
 
Protein sequence
MTNFRTQAKA AEAAMRDVFP PTPLQRNELL SQRFGAEIYL KREDLSPVRS YKIRGALNAM 
RKQTSKDLFV CASAGNHAQG MAYMCRQMDK RGVIFMPVTT PQQKIQKTRM FGGDNVEVHL
IGDYFDDTLS AAQAWCAEKG GYFLSPFDDA DVIEGQASIA VEIEAQLGKA PDHIVLPVGG
GGMSSGVVRY FGEDVHALLV EPEGGACLKA ALEAGHPTPL NRVDTFVDGA AVGKIGERPF
DILRSVPLPD VLTVSEDRIC TTILEMLNVE GIVLEPAGAL AVEALGDLRT WIKGKTVVCL
TSGGNFDFER LPEVKERAQR YSGVKKYFLL RLPQRPGALK EFLNILGPDD DIARFEYMKK
SARNFGSVLI GIETKRPENF APLFAQLDAA GFTYTDITND ETLAQFVI