Gene TM1040_3382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3382 
Symbol 
ID4075281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp397569 
End bp398549 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content58% 
IMG OID638004890 
Productpeptidase M19, renal dipeptidase 
Protein accessionYP_611616 
Protein GI99078358 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.290781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG ATGGTCTGCA ATACGCAAAC TGGTCGGAGA AGATCTTTCG CCAGCTTCGC 
GAAGGGGGCG TGGACGCGAT CCACGTCACC ATCGCCTATC ACGAGAACTT TCGTGAAACG
GTTCTGAACT TTGAAAAATG GAATCGATGG TTTGAGCAAT ACCCGGACCT GATCATGAAG
GGCCAGTGGG CGCAGGACAT CGACATCGCA CGCGAGACGG GCAGAACGGC CGTGTTTTTT
GGGTTCCAGA ATCCCTCGCC GATCGAGGAT GACATCGGCC TGGTCGAGAT CCTTCATAGT
CTCGGCGCCC GCTTCATGCA GCTGACCTAT AACAACCAGT CGCTACTGGC GACGGGCTGT
TACGAGGCCG AAGACATGGG CCTGACCCGG ATGGGCAAAC AGGTTGTAAA GGAAATGAAC
CGCGTCGGCC TCGTGATCGA CATGAGCCAT TCGTCGGATC GCTCCACCAT TGAGGCGGCG
GAATATTCCA CACGCCCCAT CGCGATCACC CATGCCAATC CGCACGCATG GTCTCCTGCC
CTGCGCAACA AGAAAGACGC GGTGATCCGC GCGGTTACCG AAAACGGCGG CATGTTCGGT
TTTTCGGTCT ATCCGCACCA CCTGAGGGAC AAATCCGACT GCACGCTGGA GAGTTTCTGC
GAAATGATCG CGCGCACTGC TGACACCTAT GGGGTGGAGC ATCTCGGAAT CGGCACTGAC
CTTTGCCAGG ACCAGCCCGA CAGTGTCGTG GAATGGATGC GCGTCGGGCG TTGGACGAAA
GAAATCGACT ACGGCGAAGG GTCCAAGTCC GCGCCCGGTT TCCCACGGAT GCCCAGCTGG
TTCGAGGACA ACAGAGACTT TGAGAACATC GAGCAGGGGC TTCGGTCCGT TGGCATGACG
ACCGGTGAAG TCGCCGCCAT CATGGGCGGC AACTGGTACC GCTTTTTTGC AGAAAGCTTT
GGGCCCAAGG CGGGAGGCTA A
 
Protein sequence
MRIDGLQYAN WSEKIFRQLR EGGVDAIHVT IAYHENFRET VLNFEKWNRW FEQYPDLIMK 
GQWAQDIDIA RETGRTAVFF GFQNPSPIED DIGLVEILHS LGARFMQLTY NNQSLLATGC
YEAEDMGLTR MGKQVVKEMN RVGLVIDMSH SSDRSTIEAA EYSTRPIAIT HANPHAWSPA
LRNKKDAVIR AVTENGGMFG FSVYPHHLRD KSDCTLESFC EMIARTADTY GVEHLGIGTD
LCQDQPDSVV EWMRVGRWTK EIDYGEGSKS APGFPRMPSW FEDNRDFENI EQGLRSVGMT
TGEVAAIMGG NWYRFFAESF GPKAGG