Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0641 |
Symbol | |
ID | 4076128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 683825 |
End bp | 685609 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638005938 |
Product | peptidase M24 |
Protein accession | YP_612636 |
Protein GI | 99080482 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.117815 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCAAA ACTTTGATGT CACGGCCCGT CCCGAGCAGG GCCCGGCGCG CCTTGCCGCG CTTCGCGCTG AGATGGAGCG AGACAAGATC GACGGTTTTT TGGTCCCGCG CGCCGATGCG CATCAAGGCG AATACGTCGC CCCACGGGAT GAACGTCTGG CGTGGCTCAC CGGCTTTACC GGCTCTGCGG GATTCTGCGC CGTGCTGCCG CATATTGCTG GCGTGTTTAT CGACGGGCGC TATCGCACAC AGGTCAAAGG CCAAGTCGCG GATGTCTATA CCCCTGTTCC TTGGCCGGAT GTGACTTTGG GCGATTGGCT GGTGGAGCAA CTGCCAGAGG GCGGGATTGT CGCCTATGAC CCCTGGCTGC ATTCCCTGCA GGAGATCCGC GACCTGACCG AGCGGCTCGT CTCGTCGGAT ATTTCACTGG TGGAAAGCGA CAATCTAGTA GACCGCATCT GGCCCGACCA GCCAGCGCCC CCGATGCAAC CGGCTCGGGC GCATTCGGAG GACTATGCCG GAGAGAGCGC CGAGAAGAAG GCTCAGCGCC TGGCCGAGGG CCTGCGTAAA AGCGGACAGT CCGCAGCGGT CATCACTCTT CCAGACAGCA TCATGTGGCT CCTGAATATC CGTGGTTCTG ACATTCCGCG CAATCCGGTC GCCCATGCTT TTGCGATCCT GCATGATGAC GCCCGAGTGG ACCTGTTTAT GGCAGCAGAG AAGCTCTCTG AGCTCGTATT GGGCGCGCAT GTGACCCTAC ACGCGCCTGA TCGCTTTCTC GAGGCCACAG CCGGCCTCAA TGGTCAGGTC GCGGTGGACG CGCGCAGCCT GCCACAGGCT GTTGCACGGG TTTTGGGCGA CAGGCTGGCG GCGGTCGGAG ACCCCTGCGC CCTGCCAAAG GCCCGCAAGA ACGCCGCCGA GATCGCAGGC AGCGCGGCCG CACATCTGCG CGATGGGGCT GCCGTTGTCG AAACGCTGGC TTGGCTCGAT ACGCAGGAAC CGGGCACGAT TACCGAAATC GACGTGGTCA AGACACTCGA AGGGTTCCGC GCGGCAGATC CCGCGTTGCG TGACATCAGC TTTGAAACCA TCGCAGGGAC AGGCGCCAAT GGCGCAATCA TGCATTACCG TGTGACACAT GATACCAATG CGACGCTCCA AGAGGGTCAT CTTCTGGTGC TCGACAGCGG CGGGCAATAT CTCGATGGCA CCACCGACAT CACCCGCACC ATCGCCATTG GATCGCCGGG CCGTGAGGAA GCCGAAGCCT TTACGCGCGT CCTGCAGGGC ATGATCGCGG TCTCCCGGTT GCGTTGGCCC GAGGGGCGGT CCGGGCGCGA ACTTGAGGCT ATCGGCCGCC TGCCTCTCTG GATGGCTGGA CAGGATTTCA ACCATGGGCT TGGTCATGGC GTCGGCGCCT TCCTCAGCGT ACATGAAGGA CCGCAGGGTC TTTCCCGCAT TAATACGGTA CCGCTTGAGC CGGGCATGAT CCTGTCCAAC GAGCCGGGCT ACTACCGGGA GGGCGCCTTT GGCATCCGGA TTGAAAACCT CGTAGTGGTA GAAGAGGCCC CAGCCCTTGA CACCGCCGAC CCAGACCGCA AGATGCTCGC GTGGCGCACG CTGACGTTTG CGCCCATCGA CCGCCGTTTG GTGGTGCCCG AGATGCTGAG CTCGGGGGAG CGCGAGTGGC TCAATAGCTA TCACGCAGAG GTAAACCGCA CTATCGCGCC GCGCGTCAGC GCCGCTGCGG CAGAGTGGTT GAACGCGGCC TGCGCGCCGC TGTGA
|
Protein sequence | MFQNFDVTAR PEQGPARLAA LRAEMERDKI DGFLVPRADA HQGEYVAPRD ERLAWLTGFT GSAGFCAVLP HIAGVFIDGR YRTQVKGQVA DVYTPVPWPD VTLGDWLVEQ LPEGGIVAYD PWLHSLQEIR DLTERLVSSD ISLVESDNLV DRIWPDQPAP PMQPARAHSE DYAGESAEKK AQRLAEGLRK SGQSAAVITL PDSIMWLLNI RGSDIPRNPV AHAFAILHDD ARVDLFMAAE KLSELVLGAH VTLHAPDRFL EATAGLNGQV AVDARSLPQA VARVLGDRLA AVGDPCALPK ARKNAAEIAG SAAAHLRDGA AVVETLAWLD TQEPGTITEI DVVKTLEGFR AADPALRDIS FETIAGTGAN GAIMHYRVTH DTNATLQEGH LLVLDSGGQY LDGTTDITRT IAIGSPGREE AEAFTRVLQG MIAVSRLRWP EGRSGRELEA IGRLPLWMAG QDFNHGLGHG VGAFLSVHEG PQGLSRINTV PLEPGMILSN EPGYYREGAF GIRIENLVVV EEAPALDTAD PDRKMLAWRT LTFAPIDRRL VVPEMLSSGE REWLNSYHAE VNRTIAPRVS AAAAEWLNAA CAPL
|
| |