Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3044 |
Symbol | |
ID | 4075138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 13418 |
End bp | 14599 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638004545 |
Product | hypothetical protein |
Protein accession | YP_611280 |
Protein GI | 99078022 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAGT TTACGGAAAC AACGCAGGTC AGCCTGTCGG TCCGATTAAA GAACGCGTTC AAAGGCATCG GCCTCGGGAT CAGCTTTATC GGCATTGCTC TCTATTTTCT GTTCTGGAAC GAAGGCAATG CCGTGCGCAC AGCGCGGGCG CTGGACGAGG GCGCGGGGCA AGTTGTGTCG TTGGACAGCG CAACATTGGA CCCCACGTTC GAAGCCCGTC TCGTCCATAT CAGTGGCCCC GCAAAGCTTA AAGCAGCGCT TGTTGATGCC GCGCTTGGAG TGGAGGCCCC GGCGCAAACG GTGCGCCTCG AACGTATCGT GGAGCAATTT GCCTGGATCG AAGAAACGCA AACCTGGACC GACACCAAAC TCGGAGGAGG GCAGGACAAA ACGACCACAT ACACCTATCG GATGGATTGG ACCGAAACCC CCGCGAGCGG GGCCGCGTTT CGAGTGTCCG AAGGTCACAT GAACCCTCCG ATGCCGATCC GTTCCAAAAT CCTGCGCCAG CAAGACGCAA CGGTCGGCGC TTACCGCGTG TCAGAAGAGA TCTCTGATCT GGGCGGCGCG ACACCGGTGA TATTGACCGA AACACAGGCT GCTGAGATCG CAGAGGCTCT GCCGCTTTCT CAAACGGCCA AGCTGGTTGC CGGGCAAGTT GTGTTTGGTG AGACCGTCGC GCGCCCGGCA CTTGGGGACA TCCGACTGCG CTACCAAGCC GCCAGAATTG ACAGCGCCAG CGTCATTGGC CTGCAACGCG GCAATGCTCT GGTGCCCTAT ATCGCGCAGA ACGGTCGCAA GATCCACTTG CTCACGGAAG GAATAAAAAC CGCCGAAGAG ATGTTTGAGA CCGCGCAGCG CGCCAACACG GCCAAGACAT GGATGCTGCG CATCGGCTTG CTGGTTCTGC TCTTTCTAGG CTTCAAAGCG CTCTTTGGCG TTGTGGATGT GCTTGCAAGC ATTCTGCCCG TTCTGGGCTG GGTCTCGTCG TCGGTCACCT CGCTTATCAG CGTTGCATTG GCGTTCTGCC TTGGTGGTCT CACGATGGCA ACGGCCTGGT TCTATTATCG CCCAATCGTG TCTCTGGCGC TGATCGCAGT TGCCTTGGCT GTTGGTCTCG TTGGCGCGCT CTGGCTGCGC TCATCCGCAA AACATGCACC TCATCCCCCC GGAACCACGT GA
|
Protein sequence | MSQFTETTQV SLSVRLKNAF KGIGLGISFI GIALYFLFWN EGNAVRTARA LDEGAGQVVS LDSATLDPTF EARLVHISGP AKLKAALVDA ALGVEAPAQT VRLERIVEQF AWIEETQTWT DTKLGGGQDK TTTYTYRMDW TETPASGAAF RVSEGHMNPP MPIRSKILRQ QDATVGAYRV SEEISDLGGA TPVILTETQA AEIAEALPLS QTAKLVAGQV VFGETVARPA LGDIRLRYQA ARIDSASVIG LQRGNALVPY IAQNGRKIHL LTEGIKTAEE MFETAQRANT AKTWMLRIGL LVLLFLGFKA LFGVVDVLAS ILPVLGWVSS SVTSLISVAL AFCLGGLTMA TAWFYYRPIV SLALIAVALA VGLVGALWLR SSAKHAPHPP GTT
|
| |