Gene TM1040_2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2667 
Symbol 
ID4077578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2802298 
End bp2803320 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content69% 
IMG OID638007991 
Productpeptidase S58, DmpA 
Protein accessionYP_614661 
Protein GI99082507 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCG GTCCCCGCAA TCTCATCACC GATGTCACCG GCCTGCGTGT TGGCAATGCC 
GCGGACGCGC GGTTGAAATC AGGCACCACG GTCCTGACCG CGGATCAGCC GTTCACGGCA
GCGGTGCATG TGATGGGGGG CGCCCCCGGC ACGCGAGAGA CCGATTTGCT CGCGCCCGAC
AAAACCGTGC CCGCCGTTGA CGCGCTCGTG CTCTCTGGCG GCTCGGCCTA TGGGCTGGAT
GCCTGCTCGG GCGTGGTGGA CGGGCTGCGC GCCATGGGGC GCGGGTTTCG TCTTGGTCCC
GCCATTGTGC CCATCTGCCC CGGCGCGATC ATCTTTGATC TGTTGAACGG CGGCGACAAG
GACTGGCGGG ACAATCCCTA TCCCGCGCTT GGGCGCGCCG CGCTCAGGGA CGCCAGTGCG
GAGTTTGCCC TCGGCACGGT GGGCGCGGGC ACCGGCGCGC TCACGGGTAT GCAAAAGGGC
GGGCTGGGGT CGGCCTCGCT GGTGCTCGAA AATGGCCTCA CCGTGGGGGC GCTGGTGGTG
GTGAACCCGA TTGGATCGGT GACCACCCCG GGCGAGCGCC ACTTCTGGGC GGCCCCGTTT
GAAATCGACG GAGAATTCGG CGGGCTTGGG CCTGACCCCA GCGCCGGGAT CGGGCGCAGC
CTGCGCAGCC GCAAGATGGA GGCCATGGCA GAGCTGGCAG GCGCAGCCGT CTCCTCGGAG
GGGTCCAACA CCACGATTGC GATTGTGGCC ACGGATGCGG CGCTGACCAA AGCCGAAGCC
ACCCGCATGG CGACCACCGC CCATGACGGC ATGGCGCGGG CCATCCTGCC GAGCCATGGG
CCACATGATG GCGATCTGGT CTTTGCAGCC GCCACCGGCG CGCAGTCCAT GACCGACCCC
GCAGCGGATA TGTTGGCGCT GTGCCATGCC GGATCGCTCT GCCTCGCGCG TGCGATCGCC
CGCGCGGTCC ATGCCGCCAC CCCGGCCGAG GGTGATATTC TGCCCTGCTG GTCAGACTCC
TGA
 
Protein sequence
MKPGPRNLIT DVTGLRVGNA ADARLKSGTT VLTADQPFTA AVHVMGGAPG TRETDLLAPD 
KTVPAVDALV LSGGSAYGLD ACSGVVDGLR AMGRGFRLGP AIVPICPGAI IFDLLNGGDK
DWRDNPYPAL GRAALRDASA EFALGTVGAG TGALTGMQKG GLGSASLVLE NGLTVGALVV
VNPIGSVTTP GERHFWAAPF EIDGEFGGLG PDPSAGIGRS LRSRKMEAMA ELAGAAVSSE
GSNTTIAIVA TDAALTKAEA TRMATTAHDG MARAILPSHG PHDGDLVFAA ATGAQSMTDP
AADMLALCHA GSLCLARAIA RAVHAATPAE GDILPCWSDS