Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1299 |
Symbol | |
ID | 4078498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1390120 |
End bp | 1392111 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638006607 |
Product | peptidase U35, phage prohead HK97 |
Protein accession | YP_613294 |
Protein GI | 99081140 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01543] phage prohead protease, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.284846 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCCGCG AAGTCACAGC GGAGCAGATC AACGCGCGCG GCGAGGGCGG TGCGATGCGC CGCATGGGAG AGGTGCGCGA AATCAACGTC GAGGCGCGCA CTGTCGAGCT TGCCTTCTCG AGCACGACCC CGGTGCGGCG CTGGTTTGGC GATGAGGTGC TTTCCCATGA CGCCGAGGCC GTGGTTCTGG ACCGTCTGCT CGACGGCGGG GCAGTATTGG TCGGGCACAA TTGGGACGAT CAGGTCGGCG TTGTGCAAAG TGCGCGCGTC GATTCCGACG GCGTCGGGCG CGCTGTTGTG CGGTTCGGCA AGAGCGCCCG CGCGAGCGAG ATCTTTCAGG ACATCGTCGA CGGCATTCGG CAGCACGTCT CGGTTGGCTA CCGGGTGATC ACGATCAGCG AGGAAATACG GGAAGGTCAG CCGAACCTTA TCACGGTCAC GCGCTGGGAG CCGTTCGAGA TTTCGGTCGT ACCGGTGCCA GCTGATCCGA CAGTCGGCAT CGGTCGCGCG CTGGAAAATC CGCCAGAGGC GGGCGGGGCG GATCGTGGCC AAACTGGCGA AGAGAATGCG GGCGCGGTGG CCGAGCCCAA CGATACAGGA CAAAGGGAAA CACAGATGAA AACCATCATC ACCCGCGACG CCGAGGGCAA TCTTGTCCGG GCCAAGGTCG ACGAAAACGA TCAGATCATC GAAGTGATCG AGGTGCTCGA ACGTGCAGGC GCAGCGGAAA CTGCCATGCT GTCGCGTCTG CAGGAGCAGG AAGCGGCCCG CGTGCGCGAA CTGACCGAGT TGGGTCGCGA ATACGACGCG CCGGAGCTCG CAACAGAAAT GATTGCCGGG CGTCATGGCG TGACCGACAT GCGCGAACGC CTGCTTGATC ACCTACACCA GCGCAGCAAT GAAAGTCGCC AAATTTCTGA GCGTTCCAGC ATTGGCCTGA CCGACAGCGA AACAGAGCAG TTTTCGTTCC TGCGCGCGAT CCGGGCGCTG GCAAATCCGA CAGACCGCAG CGCCCAGGAA GCGGCTGCTT TCGAGTTCGA GGTCTCCGAT GCTGCCGCCG AGGCGCAGGG GCGCGATGCG CAGGGCGTAA TGGTGCCGAT GGACGTGCTG ATGCGTGCGC CGCTGAACAC GGGTTCCGGT GGCGCGACCG CTGCCGATAC CGGCGGCAAC ACCATCGCAA ACCCGTTGCT GACGCAGAGC TTTATTCAGA TGCTGCGCAA CCGTACGATC CTTTTGCAGC TCGCGACTCC GCTGATGGGT CTGGTGGGCA ACCCTGATAT CCCGACGCAG GAAGGTGGTG CGACCGGCTA CTGGATCGGT GAGGATATCG AGGCGACAGA GGATCTGCTG TCTCTGGGTC AGCGCCAGTT TTCTCCGAAA ACTGTTGCGG CCTATTCGGA GATCACGCGC CGCACTCTCA AGCAATCCAG TTTGGATATC GAGGCGCTGG TGCGTAGTGA CCTTGCGCTC GCATTGGCAA CCTCGCTGGA TTTTGCGGGT TTCTATGGCA CCGGCGCCAA CGATCAGCCG CTGGGTATCG CGAATACGAG CGGCGTGAAT GTGGTCGACT TTGGCGGCGC AGCGTCTGGT GGCGGATCCG CCCTCCCGAC CTGGGCCGAA GTGATCCAGA TGGAGAGCGA GATTTCTGCT GCAAATGCCG ATGTGAATAG CATGGCGTAT GTGCAGAACG CCAAGATGCG TGGTCACTTC AAGAGCACGC AGAAATTCAG CGGCACAAAC GGTGCGCCCA TCTGGGAGAG CGACAACACC GTGAACGGAT ATCGCGGCGA AGTTACCAAC CAGATTAAAG ATGGTGACGT GTTCCACGGT GATTTCGCGA ACGTCCTGGT TGGCATGTGG GGTGGTCTGG ATATTACGGT GGACCCGTAC ACCCACAGCC GCCGCGGGCG CCTGCGCATC GTGACGATGC AGGACGCGGA TTATGTTCTG CGTCACCCGG CGGGCCTCTG CTTCGGCACT GACGCCAGCT AA
|
Protein sequence | MTREVTAEQI NARGEGGAMR RMGEVREINV EARTVELAFS STTPVRRWFG DEVLSHDAEA VVLDRLLDGG AVLVGHNWDD QVGVVQSARV DSDGVGRAVV RFGKSARASE IFQDIVDGIR QHVSVGYRVI TISEEIREGQ PNLITVTRWE PFEISVVPVP ADPTVGIGRA LENPPEAGGA DRGQTGEENA GAVAEPNDTG QRETQMKTII TRDAEGNLVR AKVDENDQII EVIEVLERAG AAETAMLSRL QEQEAARVRE LTELGREYDA PELATEMIAG RHGVTDMRER LLDHLHQRSN ESRQISERSS IGLTDSETEQ FSFLRAIRAL ANPTDRSAQE AAAFEFEVSD AAAEAQGRDA QGVMVPMDVL MRAPLNTGSG GATAADTGGN TIANPLLTQS FIQMLRNRTI LLQLATPLMG LVGNPDIPTQ EGGATGYWIG EDIEATEDLL SLGQRQFSPK TVAAYSEITR RTLKQSSLDI EALVRSDLAL ALATSLDFAG FYGTGANDQP LGIANTSGVN VVDFGGAASG GGSALPTWAE VIQMESEISA ANADVNSMAY VQNAKMRGHF KSTQKFSGTN GAPIWESDNT VNGYRGEVTN QIKDGDVFHG DFANVLVGMW GGLDITVDPY THSRRGRLRI VTMQDADYVL RHPAGLCFGT DAS
|
| |