Gene TM1040_2337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2337 
Symbol 
ID4078327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2456281 
End bp2457702 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content65% 
IMG OID638007659 
Productmicrocin-processing peptidase 2 
Protein accessionYP_614331 
Protein GI99082177 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.652541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACAA CCGCTTTTTC CCCGTTTGAA ACCACACTTC CCGAAGATGA GGCGCTGCCG 
CTCCTGCGTG ACGCACTGGC AGGTGCCGAC GATGGCGAGA TCTTTGCCGA GCGCACAAAA
TCAGAGGCAT TGGTATTCGA CGACGGCCGT CTGCGTACGG CGAGCTATGA TGCTGCCGAA
GGATTCGGGC TGCGCGCGGT GCGGGGCGAA GTGGCGGGGT ATGCCCATTC GACCACCATG
TCGATCTCGG CGCTGCGCCG CGCGGCCGAA ACCGCACGGC TCGCAGTGGG CGCTGGCGGC
GGCACCATGG CCCCTGCCCC GCAGGCCACC AATCAGAAAC TCTATGGCGA TCTGGATCCG
ATCGCGGCGC AGGCCTTTCC CGTGAAGGTC GAGACCCTGC GTGAGATCGA CAGTTTTGCG
CGCGACCTCG ATCCGCGCGT CGTACAGGTC TCGGCCACGC TGGCAGCATC CTTGCAGGAA
ATCGAGATCC TGCGCGCCGA TGGCACCCGC GTGCGCGACG TGCGACCGAT GACGCGCGTG
AATGTCTCGA TCATCGTCGA GGACGGCGGA CGGCGCGAGA GTGGCACTGC GGGCGGTGGC
GGTCGGGTTG GCCTTGATGG GCTGATCGCG CCCGAGGACT GGCAGGCCAA AGCGCGTGAG
GCGCTGCGAA TCGCACTGGT GAACCTCGAC GCGGAACCTG CGCCCGCCGG CGAGCTTGAC
GTGGTACTTG GCCCCGGCTG GCCCGGCATC CTGCTGCACG AGGCGATCGG ACACGGGCTG
GAGGGCGATT TTAATCGGAA GGGATCCTCG GCATTTGCCG GGCTCATGGG ACAGCGCATC
GCAGCCCCCG GCGTTACCGT GCTGGACGAT GGCACCATTC CGGACCGGCG CGGTTCGATC
ACCGTGGACG ACGAGGGCAC GCCAAGCCAG AAGACCACAT TGATCGAGGA CGGCATCCTC
GTAGGGTACA TGCAGGATCG CCAGAACGCG CGCCTGATGG GCGTGGAGCC CACCGGTAAC
GGGCGCCGTC AAAGCTATGC ACACGCGCCG ATGCCGCGGA TGACCAACAC CTATATGCTC
GGCGGTGAGG CGACGCCCGA GGATCTGGTC AAAGAGGTCA AGGACGGAAT CTGGGCCGTC
GGCTTTGGCG GGGGACAGGT GGATATCACC AACGGCAAAT TCGTATTCTC CTGCACCGAA
GCCTACCGCG TCAAAGACGG CAAGGTCGGC GCCCCCGTCA AAGGCGCCAC GCTGATCGGA
GACGGCGCCA CTGCGCTGCA GCAAATCCGC GGGCTCGGCA ATGACATGGC GCTTGACCCC
GGGATGGGGA ACTGCGGCAA ACAAGGCCAA TGGGTACCTG TCGGCGTGGG CCAGCCCAGC
GTGCTCATGG GCGGATTGAC GGTCGGCGGA TCTGCGACCT GA
 
Protein sequence
MDTTAFSPFE TTLPEDEALP LLRDALAGAD DGEIFAERTK SEALVFDDGR LRTASYDAAE 
GFGLRAVRGE VAGYAHSTTM SISALRRAAE TARLAVGAGG GTMAPAPQAT NQKLYGDLDP
IAAQAFPVKV ETLREIDSFA RDLDPRVVQV SATLAASLQE IEILRADGTR VRDVRPMTRV
NVSIIVEDGG RRESGTAGGG GRVGLDGLIA PEDWQAKARE ALRIALVNLD AEPAPAGELD
VVLGPGWPGI LLHEAIGHGL EGDFNRKGSS AFAGLMGQRI AAPGVTVLDD GTIPDRRGSI
TVDDEGTPSQ KTTLIEDGIL VGYMQDRQNA RLMGVEPTGN GRRQSYAHAP MPRMTNTYML
GGEATPEDLV KEVKDGIWAV GFGGGQVDIT NGKFVFSCTE AYRVKDGKVG APVKGATLIG
DGATALQQIR GLGNDMALDP GMGNCGKQGQ WVPVGVGQPS VLMGGLTVGG SAT