Gene TM1040_0569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0569 
Symbol 
ID4076134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp605508 
End bp607328 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content58% 
IMG OID638005866 
Productpeptidase M3B, oligoendopeptidase-related clade 3 
Protein accessionYP_612564 
Protein GI99080410 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0425519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000722694 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCCAAC TCCCTTTCCC TGTGCGCGAT GCCAATGCCT CTTCCGGAGC CGGTGATCTT 
GGCAAATTGC CGGAGTGGGA TCTGAGCGAT CTCTACGCAG GCGAAGACGC TGCGGAACTT
TCCCGCGATC TGGACTGGCT GCAGGGTGAA TGCGCTGCCT TTGCCGCCGA CTACGAGGGC
AAGCTTGCAG AGCTCGACGC CGAGGGGCTT CTGACCTGCG TCCATCGCAA CGAAAAAATC
AACAATATCG CCGGGCGCAT CATGTCCTAT GCGGGCCTGC GCTATTACCA GCTGACAACC
GACGCGGACC GCGCCAAATT CCTCTCTGAC GTGCAGGAGA AAGTCACGGT CTTCACCACG
CCGCTGGTGT TCTTCACGCT GGAAATCAAC CGCATCGAGG ATGCCAAACT CGACGCGCTC
TTTGAGGCAA ACGCCGATCT GGCGCGCTAC AAACCCGTTT TTGACCGGAT CCGCGCGATG
AAGCCCTATC AGCTCTCGGA TGAGCTGGAG CGTTTTATGC ACGACCTCGG GATTGTGGGC
GATGCCTGGG AAAAGCTCTT TGACGAGACC ATCGCAGGGC TGACCTTTGA GATCGAGGGC
GAAGAGCTCG GCATCGAGGC GACGCTCAAC TTCCTGACCG AGCAGGACCG CAGCAAGCGC
GAAGCCGCAG CGCGCGAACT TGCCCGCGTC TTTGCCGACA ACATCAAGAT CTTTGCCCGC
GTTCACAACA CGCAGGCCAA AGAGAAAGAG ATCATTGACC GCTGGCGCGG CATGCCGAGC
CCGCAGATGG GCCGGCACCT CTCGAACGAT GTCGAGCCCG AAGTGGTCGA AGCTCTGCGC
GAGGCCGTAG TGGCCGCCTA CCCCAAGCTT TCGCACCGCT ACTATGAGCT CAAACGCAAA
TGGCTCGGTC TCGATCGCAT GCAGGTCTGG GATCGCAACG CGCCTCTGCC GATGGAAACC
ACCCGCGTGG TGGACTGGGA GGAGGCCCGC GCAACCGTGA TGGAGGCCTA TGAGGCCTTT
GATCCACGCA TGGGCGAGCT GGCGCAGCCG TTTTTCGACA AGGGCTGGAT CGACGCAGGC
GTCAAACCCG GCAAAGCCCC CGGTGCATTT GCTCATCCAA CAGTGACCAA TGTCCATCCG
TATGTGATGC TGAACTACCT GGGCAAACCG CGTGACGTGA TGACGCTGGC GCATGAACTT
GGCCACGGTG TTCATCAGGT TCTTGCCGCA GATCAGGGCG AGATGCTCTC TTCGACACCC
CTGACCCTTG CGGAAACCGC ATCGGTCTTT GGCGAGATGC TCACTTTCCG CAAGATGCTC
GAAAAGGCCC AGACCAAAGA AGAGCGCAAG GTGCTCTTGG CAGGCAAGGT CGAGGACATG
ATCAACACGG TCGTGCGCCA GATCGCCTTT TATGACTTTG AGTGCAAACT ACACGCCGCA
CGCGCCGAAG GTGAGCTTAC GCCCGAAGAC ATCAACGCCC TCTGGATGAG TGTGCAAGCG
GAGTCCCTGG GCGAGTCCTT TGACTTCATG GAAGGGTATG AGACCTTCTG GGCCTATATT
CCGCACTTCG TCCACTCGCC CTTCTACGTC TATGCCTATG CATTTGGCGA TGGCCTCGTG
AACGCGCTTT ACTCGGTCTA TGCCGAAGGG GCCGAGGGAT TTGAGGACAA GTATTTCGAC
ATGCTCAAGG CTGGCGGGTC CAAACATCAC AAAGAGCTTT TGGCACCATT TGGTCTTGAT
GCCTCTGACC CCAAGTTCTG GGACAAAGGC CTGTCGATGA TCTCTGGCCT GATTGACGAG
CTTGAGGCAA TGGAAGCTTG A
 
Protein sequence
MFQLPFPVRD ANASSGAGDL GKLPEWDLSD LYAGEDAAEL SRDLDWLQGE CAAFAADYEG 
KLAELDAEGL LTCVHRNEKI NNIAGRIMSY AGLRYYQLTT DADRAKFLSD VQEKVTVFTT
PLVFFTLEIN RIEDAKLDAL FEANADLARY KPVFDRIRAM KPYQLSDELE RFMHDLGIVG
DAWEKLFDET IAGLTFEIEG EELGIEATLN FLTEQDRSKR EAAARELARV FADNIKIFAR
VHNTQAKEKE IIDRWRGMPS PQMGRHLSND VEPEVVEALR EAVVAAYPKL SHRYYELKRK
WLGLDRMQVW DRNAPLPMET TRVVDWEEAR ATVMEAYEAF DPRMGELAQP FFDKGWIDAG
VKPGKAPGAF AHPTVTNVHP YVMLNYLGKP RDVMTLAHEL GHGVHQVLAA DQGEMLSSTP
LTLAETASVF GEMLTFRKML EKAQTKEERK VLLAGKVEDM INTVVRQIAF YDFECKLHAA
RAEGELTPED INALWMSVQA ESLGESFDFM EGYETFWAYI PHFVHSPFYV YAYAFGDGLV
NALYSVYAEG AEGFEDKYFD MLKAGGSKHH KELLAPFGLD ASDPKFWDKG LSMISGLIDE
LEAMEA