Gene TM1040_0648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0648 
Symbol 
ID4078161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp691865 
End bp693112 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content60% 
IMG OID638005945 
Productpeptidase T 
Protein accessionYP_612643 
Protein GI99080489 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.60889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA GCTTTGACAA CACCCTTCAA TCCCGTCTCG TGCGCTATGC TGCCATCGAC 
AGCCAGAGCG ATGAGAATTC CCCCACGGCA CCCAGCACCA AGATCCAGTT TGATATGTTG
CACTTGGTGC GCGACGAGTT GACCGAGATC GGAGCGCAAG ACGTGCAGCT GACCGATTAC
GGCGTTGTTC TGGCGACTAT CCCCGGCAGG TCGGAGGCCC CCACAGTGGG CTTCCTCGCC
CATGTCGACA CTGCCCCGCA GTTCAATGCC ACCGGTGTGA AACCTCGTGT GATCAAAGGG
TACAATGGCG GCGAGATCAG CTTTCCCGAT GATCCGGAGC TGGTGCTCTC TCCCGAGGCG
CACCCCTATC TGGCCGAGAA AATCGGCCAT GATCTGATCA CTGCGTCTGG CACCACATTG
CTTGGGGCGG ACGACAAGGC GGGGGTTGCG ATCATAATGA CGATGGCAGA GCACCTGCTG
CAAAACCCGG ATGCCGCCCG TCCAACAATC CGGATTGCCT TCACTCCGGA CGAAGAGATC
GGTCGCGGCG TGCAACCCCA GCTGGAGCGC GACCTCGGGG CGGATTTTGC CTATACGCTC
GATGGCGGGG AACTCGGCGA AGTGGAGTAC GAGAGCTTTT CGGCCGACCG CGCAGTTGTC
AAAATCACCG GCGTTTCCAT TCACCCGGGC CTCGCCAAGG AAAAGATGGT CAATGCCATC
CATCTTGCCT CCAAGATCAT CCAGACGCTG CCTCAGGCCA CGATGACGCC CGAGACAACA
GCGGACCGCG AGGGGTTCAT CCATGCCACC GACATGTTCG GGGGGTCTTC CGAGATGGAG
ATCCGCTTCA TCCTGCGCGA TTTTGAAATG GCCGATCTTG AGGCCAAGGG CGCCCTTCTA
CGCAGCGTCT GCGAGGCGGT TGCGGCAACC GAACCGCGCG CGGAGATCAC CTGCGAGATT
ACCCCGCAAT ATCGCAACAT GCGGTATTGG CTTGAAAAGG ACATGACCCC TGTCGATCTC
GCCCATGCCG CCTGCCGCGA TGTGGGACTG GAGCCGGTAT CGGTTCCGAT CCGCGGGGGC
ACCGATGGGT CTCGCCTGAC CGAATTCGGC ACGCCGACGC CCAATATCTT TACGGGGATG
CAATGCATTC ATGGCCCCTT GGAGTGGATC TCGGTTCAGG ATATGTCGCT GGCAACGCAG
ATGTGTCTGA GCCTCGCGGC GCGCGCCGCA ACGCTCGACA AGACCTAA
 
Protein sequence
MTDSFDNTLQ SRLVRYAAID SQSDENSPTA PSTKIQFDML HLVRDELTEI GAQDVQLTDY 
GVVLATIPGR SEAPTVGFLA HVDTAPQFNA TGVKPRVIKG YNGGEISFPD DPELVLSPEA
HPYLAEKIGH DLITASGTTL LGADDKAGVA IIMTMAEHLL QNPDAARPTI RIAFTPDEEI
GRGVQPQLER DLGADFAYTL DGGELGEVEY ESFSADRAVV KITGVSIHPG LAKEKMVNAI
HLASKIIQTL PQATMTPETT ADREGFIHAT DMFGGSSEME IRFILRDFEM ADLEAKGALL
RSVCEAVAAT EPRAEITCEI TPQYRNMRYW LEKDMTPVDL AHAACRDVGL EPVSVPIRGG
TDGSRLTEFG TPTPNIFTGM QCIHGPLEWI SVQDMSLATQ MCLSLAARAA TLDKT