Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2639 |
Symbol | |
ID | 4077942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2773042 |
End bp | 2774463 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638007963 |
Product | microcin-processing peptidase 1 |
Protein accession | YP_614633 |
Protein GI | 99082479 |
COG category | [R] General function prediction only |
COG ID | [COG0312] Predicted Zn-dependent proteases and their inactivated homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.818625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGA GCCCTTGCGA TACGGCGGCG CTCTGGCTAG CTCTGGTTCA ATGCCACCCA TTTGCGAAGG TGCCCATGAC ACAGACTCCT GAAACGCTTT GCCACGCCCT CCTTGATGCC GCCCAAAAGG CCGGGGCCGA TTCTGCCGAC GCCATGGCCG CCGAGGGCAG CTCGCTCTCG ATCGAGGTGC GCGAGGGCGC GCTGGAACAT GCAGAGCGCT CCGAAGGGGT GGACATCGGG CTGCGGGTCT TTGTCGGCCA GCGTCAGGCG CAGGTGTCCT CCTCCGATAC CCGCCCCGAA ACCCTGACCG CGATGGCCGA ACGCGCCGTG GCCATGGCCA AAGAAGCGCC CGAAGATCCC TATGCCGGGC TTGCTGACCC CGCGCAGCTG GCCAAATCCT GGGATCTCGA CGCCCTTGAG ATGGCCGACC CCAGCGCCGA GCCTGCGCCC GATCAACTGC AACAGGACGC GCTGGCCGCC GAAAGCGCCT GCGCCGCCAT CGACGGCATT TCTCAGGTCC AGTCCGCCGC GGCGGGCTAT GGGCGTCATG ACATCCACAT GGCCGCGAGC AACGGGTTCT CCGGGGGCTA TGCGCGCACC AGCCGCTCGA TCTCCTGTGT GGGGATTGCG GGCACCGGCA CCGGCATGGA GCGCGACTAT GACGGCGACA GCCGCATCTA TCAAACCGAT CTGCGCAGCG CCGAAGAGAT CGGGCGCACC GCTGGCGAGC GCGCCATCGA ACGTGTGAAC GCCCGCCGCC CCAAAACTGG CGCCTATCCC GTGCTCTTTG ACGAGCGGAT CTCCTCATCC CTCATCGGGC ATCTTCTGGG TGCCGCCAAT GGCGCGTCGG TGGCGCGCGG CTCCTCGTGG CTCAAGGACA GTCTCGGCGC GCAGATCCTG CCCGAGGCCT TCTCGGTCAT CGAGGACCCC CTGCGCCCCC GCGTTTCAGG CTCGCGCCCC TTTGATGGCG AAGGCCTGCC CACGCAGCGC CGCGCGATCG TCGACAAGGG CGTGCTGACC GGCTGGACCA TGGATCTGGC TTCGGCGCGC AAACTTGGCC TTGAGAGCAC AGGCAACGCC GCGCGTGGCA TCGGGTCGGT GCCGTCGCCC TCCAACTGGA ACATCGCTCT GACCCAGGGG CAACAGACCC GCGAAGAGCT GCTGCGCGAC ATGGGCACCG GGCTTCTGGT CACCTCGATG ATCGGCTCCA CCATCAACCC CAACACGGGC GACTACTCGC GCGGCGCTTC GGGCTTCTGG GTGGAGAACG GCGAGATCCA GTATCCGGTC AACGAGGTCA CGATTGCCGG GAACCTCCTC GATATGCTGA AAACGCTGGT CGCCGCCAAC GACGCCCGCA CACATCTGTC GCGGGTGGTG CCATCGCTTC TGGTAGAGGG ACTGACCCTT GCCGGAGAAT GA
|
Protein sequence | MAQSPCDTAA LWLALVQCHP FAKVPMTQTP ETLCHALLDA AQKAGADSAD AMAAEGSSLS IEVREGALEH AERSEGVDIG LRVFVGQRQA QVSSSDTRPE TLTAMAERAV AMAKEAPEDP YAGLADPAQL AKSWDLDALE MADPSAEPAP DQLQQDALAA ESACAAIDGI SQVQSAAAGY GRHDIHMAAS NGFSGGYART SRSISCVGIA GTGTGMERDY DGDSRIYQTD LRSAEEIGRT AGERAIERVN ARRPKTGAYP VLFDERISSS LIGHLLGAAN GASVARGSSW LKDSLGAQIL PEAFSVIEDP LRPRVSGSRP FDGEGLPTQR RAIVDKGVLT GWTMDLASAR KLGLESTGNA ARGIGSVPSP SNWNIALTQG QQTREELLRD MGTGLLVTSM IGSTINPNTG DYSRGASGFW VENGEIQYPV NEVTIAGNLL DMLKTLVAAN DARTHLSRVV PSLLVEGLTL AGE
|
| |