Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3768 |
Symbol | |
ID | 4074940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008042 |
Strand | + |
Start bp | 6566 |
End bp | 7546 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 638004421 |
Product | hypothetical protein |
Protein accession | YP_611163 |
Protein GI | 99077904 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 0.689278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.921006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCCG TTATCCTGCC CTCACAAATG ACCGCACACC AACGCAGAGA GGCTTGGTAT GGTGAAGAAA TCCCCCACGC TCGTTACTTG CGTGACTATG ATTTCCCACT GAAGGAAGCT CTATTTTTTT GCTTGAGCGA ACTCGAAGGG ATTCTGGGGG CAGAAGCGAT CGCCCCCAAA ATCGCTGCCA TTGTTCCGGG AAACCAACCA ACGGATTGGG ATTGGAGATC GCTACCTTTC GATAGCAGCG TGTCGCCGAG TGCGAGCGGG TGGGACTGCG CCAAACGTCT TGACGACGCA GGACTATATG GGCTTTTGGG TGTGCGACCT GCGAATGTCC CGTTTGAAGC CCGAAAGAGC TGGGTTGCGA AACTCCCCCT AGAACTAGAC GACTGGCGGC AAAACGTGCG ACTGGGCAAA GATGCAAACA ACATATCATT GATTATTGAT TTGGCCCTGT CTCGGCACGC GATGGACACT GGACTCAGTC TTGCCTCCAT AACCGACGAG GACTTGAACG AAGAGGGTGA GGTGAGCGTT CCCGCTCTAG CAGTCTTTGG CGGCGTTTCC GAAGGACGAA TTCGTAACGT ACTGAGCAGC GGGGAAAGCT GCCTTGTCCG AAAATCCGGT CAGGCTGTCA CGGCGAGCAG TGCCAAGGAG TGGCTTGCAG GACGCAAAGA GTTTTACCAG TCCATTTGGG ATTTGCCCGA AGGCGAGAAA CCCGAACCAA AAGCTCGGAA TTTTTCAGGT GAGGTTCTCT TTGTTCCGGT GGCTTCGGAC GGCAGCACGT TTAACCCAGA GCTGAAACAC AAAGGCAAAT ATACGGTCGG AGCAAAGGGG CAAGAAATCC AGTTTGATCA ATTCCAAGAT GCCCTCAACG CCCTTCAAAA GATGTCAACG CCCCGCTGGC GCCGCCCGAA TGCGGCAGGC AACTGGTGCA TCGTCTCCGG CCGCGACTGG AAGCGGATCG AAAAGAAGTA A
|
Protein sequence | MKPVILPSQM TAHQRREAWY GEEIPHARYL RDYDFPLKEA LFFCLSELEG ILGAEAIAPK IAAIVPGNQP TDWDWRSLPF DSSVSPSASG WDCAKRLDDA GLYGLLGVRP ANVPFEARKS WVAKLPLELD DWRQNVRLGK DANNISLIID LALSRHAMDT GLSLASITDE DLNEEGEVSV PALAVFGGVS EGRIRNVLSS GESCLVRKSG QAVTASSAKE WLAGRKEFYQ SIWDLPEGEK PEPKARNFSG EVLFVPVASD GSTFNPELKH KGKYTVGAKG QEIQFDQFQD ALNALQKMST PRWRRPNAAG NWCIVSGRDW KRIEKK
|
| |