Gene TM1040_3495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3495 
Symbol 
ID4075174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp531293 
End bp532558 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content62% 
IMG OID638005010 
Productdiaminopimelate decarboxylase 
Protein accessionYP_611729 
Protein GI99078471 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.844313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCATT TTCTCTATCG CGATGGCGCT TTGTACGCCG AAGATGTCCC CGTAGCCGAG 
ATTGCCGCTA CGGTGGGCAC GCCGTTCTAC GTCTACTCCA CCGCGACGCT CCTGCGCCAT
TTCCGTCTCT TTGACGAGGC GCTTGAGGGC ACCGACCATC TGGTTTGTTA CGCGATGAAG
GCCGCCTCCA ATCAGGCGAT CCTAAAGACA CTCGCGGCGG CGGGCGCAGG CATGGATGTG
GTGAGCGAAG GCGAATACCG CCGCGCCAAG GCCGCAGGCG TGCCGGGCGA CAAGATCGTG
TTTTCCGGTG TCGGCAAGAC CGCCGAAGAG ATCCGCACCG CGCTCACCGG GGGCATTCGC
CAGTTCAACG TCGAATCCGA GCCCGAGATG GACGTGATCA ATGCCGTTGC GCTCGAGCTT
GGTGTCACCG CGCCGATCAC CGTGCGGGTG AACCCGGATG TGGATGCAAA GACCCACGCC
AAGATCGCGA CCGGTAAATC CGAGAACAAA TTCGGCATCC CCATCGCCAA GGCGCGCGCG
GTCTATGCCC ATGCCGCCAG CCTGCCGGGC CTTGAGGTGA TCGGGATCGA TGTTCACATC
GGCTCGCAAC TCACGGATCT TGAGCCCTTC CGCCTTGCCT ATCAAAAGGT TGCGGAGCTG
ACACAGGCTC TGCGCGCGGA CGGTCACGAT ATTCGCCGCC TTGATCTTGG GGGCGGTCTG
GGCATCCCCT ATACCCGCTC CAATGAGGCC CCGCCGCTGC CGGTGGAATA TGGCCAGATG
ATCAAGGAAG AGCTCGGTCA TCTGGGCTGC GAAATCGAGA TCGAACCGGG CCGTCTGGTG
GCGGGCAATG CGGGGCTGAT GGTCTCTAAG GTGATCTACA TCAAAGAGGG CGAAGGCCGC
GATTTCCTGA TCCTCGACGG GGCCATGAAC GACCTCATCC GCCCAGCGAT GTATGAGGCC
CATCACGACA TCATCCCCGT GGTGGAACCG ACCCCCGGTC TCGAACCGCA ACCCTATGAC
ATCGTGGGCC CGGTCTGCGA AAGCGGCGAC ACCTTTGCCA AACAACGCCT GATGCCGCCG
CTTGCTGCGG GGGATCTGGT GGCGTTTCGC AGTGCCGGGG CTTATGGCGC GGTGATGTCC
AGCGAATACA ACTCGCGCCC CCTCATCCCC GAGGTGCTGG TCCACGGCGA TCAATTTGCA
GTCATCCGGC AGCGTCCGAC CTTTGACGAG ATGATAAATC GCGATACCAT CCCAGAGTGG
CTGTAA
 
Protein sequence
MDHFLYRDGA LYAEDVPVAE IAATVGTPFY VYSTATLLRH FRLFDEALEG TDHLVCYAMK 
AASNQAILKT LAAAGAGMDV VSEGEYRRAK AAGVPGDKIV FSGVGKTAEE IRTALTGGIR
QFNVESEPEM DVINAVALEL GVTAPITVRV NPDVDAKTHA KIATGKSENK FGIPIAKARA
VYAHAASLPG LEVIGIDVHI GSQLTDLEPF RLAYQKVAEL TQALRADGHD IRRLDLGGGL
GIPYTRSNEA PPLPVEYGQM IKEELGHLGC EIEIEPGRLV AGNAGLMVSK VIYIKEGEGR
DFLILDGAMN DLIRPAMYEA HHDIIPVVEP TPGLEPQPYD IVGPVCESGD TFAKQRLMPP
LAAGDLVAFR SAGAYGAVMS SEYNSRPLIP EVLVHGDQFA VIRQRPTFDE MINRDTIPEW
L