Gene TM1040_0286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0286 
Symbol 
ID4077421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp291466 
End bp292560 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content65% 
IMG OID638005580 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_612281 
Protein GI99080127 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.969107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC TCTCTCAGAC ACCGACCCAT GATCTGCGCA TCACCGACAT GCAGGAATTG 
ATCTGCCCCG AAGCCCTCGC GGTGAAACAC CCGCTGACAG ATGCCGCGCG CGAAACCGTG
CTCTCGGCGC GCGCCAGCAT CCAGAAGATC CTGCATGGCG CCGACGACCG GCTGGTCGTC
GTGGTCGGCC CCTGTTCGAT CCACGACCCG GAGGCCGCGC TGGACTATGC GCGCCGTCTT
GCGCCGCTGC GCGCCGAGCT GGGCGATGCG CTTGAGATCG TGATGCGGGT CTACTTTGAA
AAACCGCGCA CCATCGCGGG CTGGAAGGGG CTGATCAACG ACCCCAACCT TGATGGGTCT
TTCCGCATCA ACAAGGGGTT GTCGGTCGCC CGCAAGCTCT GCCTGGATCT GAGCGAAATG
GGCCTGCCCG TGGGGACCGA ATTCCTCGAT GCCTCGGTGC CGCAATACAT CAGTGATCTG
GTGAGCTGGG CCGCGATTGG CGCCCGCACC ACCGAGAGCC AGATCCACCG CGAAATGGCT
TCGGGCCTGA GCTGCCCGGT GGGCTTCAAG AACGGCACCC GCGGCAATGT GCAGATCGCC
ATTGACGCGG TGCGCTCGGC GGCCACACCG CATCATTTCA TGGCGCTGGC CCCCTCGGGT
CTCGCGGCGA TTGCGGCGAC GGCCGGAAAC CCGGATTGCC ACATCATCCT GCGCGGCGGC
GGTGGTACCA ACTTTGATGC CGAGAGCGTG GATTCAGCCT GCAAAAAGGC CGAAGCCGAT
GGCATCCGTC CGCAGGTGAT GATCGACGCA AGCCACGCCA ACTCTGCCAA GGATCCCGCC
AAACAGCCCG AGGTGCTCTC GGATGTGGCC GGCCAGATGG CACAGGGTGA GACCCGCATC
ACCGGCATCA TGATCGAAAG CCACCTCGAA CAGGGTCGTC AGGATCTGCC CAAGGACGGG
GACCTGTCGA AACTCACCTA TGGTCAGTCG ATCACCGACG GCTGCATCGG CTGGGAGCAA
ACCGAGGCCG AGCTGCGCAA ACTGGCCCAG GCGGTCAAAA CACGCCGCAC GCAGGGCGCC
CGCCTGGCGG GTTGA
 
Protein sequence
MTTLSQTPTH DLRITDMQEL ICPEALAVKH PLTDAARETV LSARASIQKI LHGADDRLVV 
VVGPCSIHDP EAALDYARRL APLRAELGDA LEIVMRVYFE KPRTIAGWKG LINDPNLDGS
FRINKGLSVA RKLCLDLSEM GLPVGTEFLD ASVPQYISDL VSWAAIGART TESQIHREMA
SGLSCPVGFK NGTRGNVQIA IDAVRSAATP HHFMALAPSG LAAIAATAGN PDCHIILRGG
GGTNFDAESV DSACKKAEAD GIRPQVMIDA SHANSAKDPA KQPEVLSDVA GQMAQGETRI
TGIMIESHLE QGRQDLPKDG DLSKLTYGQS ITDGCIGWEQ TEAELRKLAQ AVKTRRTQGA
RLAG