Gene TM1040_2701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2701 
Symbol 
ID4077008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2842959 
End bp2844122 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content61% 
IMG OID638008026 
ProductD-3-hydroxyaspartate aldolase 
Protein accessionYP_614695 
Protein GI99082541 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3616] Predicted amino acid aldolase or racemase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.75052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.463179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAC CCGCAAATTT CGACAGCCTC GAAGTGGGCT TTGACGTCCC CGCCCTGCCC 
GGCATGGATG AGGCCGACAT CCAGACCCCC TGTCTGGTGC TCGACCTCGA CGCGCTGGAG
CGCAACATCA AGAAGATGGG CGATTACGCC CGCGCGCACG GCATGCGCCA CCGCGTGCAT
GGCAAGATGC ATAAATCGGT GGATGTGGCC AAACTCCAAG AGCGTCTTGG TGGCGCAATC
GGGGTCTGCT GCCAGAAGGT CTCTGAGGCC GAAGTCTTTG CACGAGGCGG CATAAAGGAC
GTGTTAGTGT CTAATCAGGT GCGCGATCCG GCCAAGATCG ACCGGCTGGC GCGGCTCCCG
AAACTGGGCG CGCGTACCAT CGTCTGTGTG GATGATCCGG CCAATGTCGC GGATCTGTCA
GCAGCGGCAC AGAAGCATGA CACCGAGATC GAGTGCTTTG TGGAGATCGA CTGTGGCGCT
GGACGCTGCG GCGTGACCAC CACGGAAGAC GTGGTCGAGA TCGCGCAGGC CATTGATGCC
GCCCCGGGGC TAAAGTTCAC CGGCATCCAA GCCTACCAGG GCGCCATGCA GCACCTCGAC
AGCTACGAGG CGCGCAAGGA AAAGCTCGAC GTTGCAATCG CTCAGGTGCG TGATGCGGTC
GAGGGGCTGA AGGCTGCGGA TCTTGCGCCG GAATTGGTCT CTGGAGGCGG TACAGGCTCT
TACTACTTCG AGTCTAACTC AGGGGTTTAT AACGAATTGC AGTGTGGCTC CTACGCGTTC
ATGGATGCCG ACTATGGCCG AATTCTGGAC CAGGACGGCA AGCGGATCGA CCAGGGCGAG
TGGGAGAATG CCTTCTTCAT TCTGACCCAG GTCATGAGCC ACGCCAAAGC TGACAAGGCG
ATCTGTGATG CGGGCCTCAA GGCGCAATCC GTGGACAGCG GACTGCCGTT CATCTTTGGT
CGCACGGATG TCGAATACGT CAAATGCTCG GACGAGCATG GGGTGATAGC TGACCCCGAT
GGCGTGCTGA AGGTGGGCGA GAAACTGAAG CTGGTCCCCG GCCATTGTGA TCCCACCGCG
AATGTCCACG ACTGGTACGT CGGCGTCCGC AACGGCAAGG TCGAAACCCT CTGGCCAGTG
TCCGCGCGCG GCAAGGCCTA CTGA
 
Protein sequence
MNAPANFDSL EVGFDVPALP GMDEADIQTP CLVLDLDALE RNIKKMGDYA RAHGMRHRVH 
GKMHKSVDVA KLQERLGGAI GVCCQKVSEA EVFARGGIKD VLVSNQVRDP AKIDRLARLP
KLGARTIVCV DDPANVADLS AAAQKHDTEI ECFVEIDCGA GRCGVTTTED VVEIAQAIDA
APGLKFTGIQ AYQGAMQHLD SYEARKEKLD VAIAQVRDAV EGLKAADLAP ELVSGGGTGS
YYFESNSGVY NELQCGSYAF MDADYGRILD QDGKRIDQGE WENAFFILTQ VMSHAKADKA
ICDAGLKAQS VDSGLPFIFG RTDVEYVKCS DEHGVIADPD GVLKVGEKLK LVPGHCDPTA
NVHDWYVGVR NGKVETLWPV SARGKAY