Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3581 |
Symbol | |
ID | 4075509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 631198 |
End bp | 632346 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638005101 |
Product | hypothetical protein |
Protein accession | YP_611812 |
Protein GI | 99078554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.106267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.283892 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCGG ATATCCTGCA TTCGGGCTTT GATGGGCTTA AATTCACTGT CGAGACCGAT ATCCCGCCCG AGCTGCGCAC GGCACTGGCT GAGGCCAAGG CGCAGGCAAT CCAGACCAAT GCCGAGACCG TTATGGAATT TGGCTCTGTG GCTCTCTCAG TGCGCCGTAC AGGCGGTTCG GCCTTTTCTG CCCATACTGG GGAGTATGGG GCCGAGTGGT ACTTTCTCGA CCCGGAAAAC CGCCCTGCAA ACAATCCCGG CATTACCGTG GACTTTCGCG CCTTCCTTCT AGCAACTGGC GGGCTGGACG CCGCAGAGAA ACACTTTCGC ACCTGCATGG ACGCCTTCGG CATTCGTTAT GCCGATCATC TCTTGCGCGT GAGCCGTGTG GATTATGCCA TCGACTTTCT GGCCCCTTGG TTTGAACCAG ACCGCGAGGC TCTGGTGGTG CCACCCGGCA CACGTGTTCA GGAACACACC GGTATTGATG AAACAGAAAC CCATGCCACC GGTGCGCGCG TCACCGGCCT GCGCGCCGGA GCCGTCGCCA ACCGGCAGTT GGTGATCTAC GACAAGCGAC AAGAGGTTAT GCAAAAGGGC AAGCTGGGCT GGCTCACCAT CTGGAACGAC GCCCGCGCCC AGTTGAACCG TCCGCCCCTC GACCTCACAG ACCGGATGAC CAGCCAAGTC TGGCGCTTTG AGCTGCGCAT GGGATCCAAG CAACTGCGCA ACCGCTGGGA AATGCGGTCA TGGCAAGACC TACGCGATAT GGTCGGAGAC GCCTACGCCG AGTTCTGCGA AAAGATCCGC TACACCTGCC CCACCACCGA CAGCAACCGC GCCCGCTGGC CAACACATGA CCTTTGGCGC GAGGTCGCGA GCGTGATCGC GAATGACCTA TATGAGAATT GCTCTGGCGT GTTGCCAAGC GAGGTGATCG AGACCAACCG GGCCGAACAC ATGCGCATGC TGGACCGGCA AATCCTTGGC CTTCTGGTGT CCCGTGCAGC AGCGTCAGAG GTTCAGCCGC ATGAGTTCGC GGAGTTTCTA GATACGCATA TAGAAGCGAT TGAACGGATG TCGGAAGAAC ACGCAACACC ACTGGCGGAA CGGATTAGGA AGGCGACAGA GCGGTATCGA TTCAAATAG
|
Protein sequence | MEADILHSGF DGLKFTVETD IPPELRTALA EAKAQAIQTN AETVMEFGSV ALSVRRTGGS AFSAHTGEYG AEWYFLDPEN RPANNPGITV DFRAFLLATG GLDAAEKHFR TCMDAFGIRY ADHLLRVSRV DYAIDFLAPW FEPDREALVV PPGTRVQEHT GIDETETHAT GARVTGLRAG AVANRQLVIY DKRQEVMQKG KLGWLTIWND ARAQLNRPPL DLTDRMTSQV WRFELRMGSK QLRNRWEMRS WQDLRDMVGD AYAEFCEKIR YTCPTTDSNR ARWPTHDLWR EVASVIANDL YENCSGVLPS EVIETNRAEH MRMLDRQILG LLVSRAAASE VQPHEFAEFL DTHIEAIERM SEEHATPLAE RIRKATERYR FK
|
| |