Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0802 |
Symbol | |
ID | 4076074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 849590 |
End bp | 850969 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006100 |
Product | hypothetical protein |
Protein accession | YP_612797 |
Protein GI | 99080643 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000418176 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGACCCAAT CCGTCGCACA ACGCACCCCC GCAGCCGAGG CGCTGGTCAA GAGCGCCGCT CTGGGCCGAA TCCTGATGGG TGGCACCAAG GCCATGCGTG GAGCGGGTCA AAAGCACCTG CCGAAGTTTC CAGCAGAGAG CCACGAGGCA TACGAGGCCC GCCTCGCGTC GTCGTTCCTG TTCAACGGCT TCAAAAAGAC CGTGCGCGAT ATGACCGGCC GGGTATTCAC CAAGTCGGTT GAGATCGAGA GCGACACGCT CAAAGAGGAA GAGCAGAACA TCGACATGCA GGGCCGCGAC CTTTCGGCCT TTGCGCGTGA CGTGTTTGAG GCAGGCTTGA GCGGCTGCGG ACTGTCTTTC ATCATGGTTG ATGCCCCGCC GCGCCAGGGT GACGTGACCA AGGCACAGGC TCAAGCGTCA AACCTTCGGC CCTATCTCGT GCATGTGAAG GTAGAGGAAG TGCTGGGCTG GAAAACTGCC ACCGAGGGCA GCCGCACGTT TCTGAGCCAA TGGCGCATGA TGGAAACGCT GGAGCTTGAT GACCCGGAGG ACGAGTTCTC CGTAGCGTCC ATGAAGCAAG TGCGCGTGCT GACGTTGGTA GAGGGCCGGG TGAACGTCCG CCTCTACCGC GAGGCCGACA AGGGCGCAAA CTGGGAGCTA CACGCAGAGT TCGACACTCA GGCCACCGAG ATCACCGTTG TGCCGTTCTA CGCCAACCGC ACAGGCTTCT TCGCCGCCGA GCCTCCTCTA GAGGATCTTG CAGACAAGAA CGTCGAGCAT TGGCAGAGCG CCAGCGATCA GCGCAACATC TTGCATTTCG CCCGGGTGCC GATCCTCTTC GCATCTGGTC GAGACGATGA AAGTCCGCTG ACCTTCAGCG CGAGTGCGGC GACGACAGCG CGCGACCCGA ACGCCAAGCT GGAATGGGTC GAGCATTCTG GCGCAGCAAT CGGGGCAGGG CGTGACGACC TCAAAGATCT GGAATTTCAG ATGCAGGTAC TCGGCTTGCA ACTTTTGGTG GCCGGTACGG AAACGGCGAC GGGGGCCTCT CTTGACGCGG CAAAGGAGAC AGCGCCGCTC GCAATGATGG CCGACAACCT CAAGGACAGC CTCGAACAGG CGCTGCGCTG GTTTACGATG TACCAGGGCA GCGAGAACGC TGTGACGGTT AAGGTCAACA AGGACTTCGG CGTCTCGATG CTGACCGCGC AGGAACTGAC GGTGATGCTG ACCGCTGTGA ATACCGGCAA CATGCCCCGC CGGGTGTTCG TTGAGGAAAT GAAGCGCCGC GGCTTCATCG CCGAGGACAC CGATACGGAC GCATATCTGG GCGATCTGGA CGACGAAACG CCGCCGGGGC TCACCGATGG CGGTGAATGA
|
Protein sequence | MTQSVAQRTP AAEALVKSAA LGRILMGGTK AMRGAGQKHL PKFPAESHEA YEARLASSFL FNGFKKTVRD MTGRVFTKSV EIESDTLKEE EQNIDMQGRD LSAFARDVFE AGLSGCGLSF IMVDAPPRQG DVTKAQAQAS NLRPYLVHVK VEEVLGWKTA TEGSRTFLSQ WRMMETLELD DPEDEFSVAS MKQVRVLTLV EGRVNVRLYR EADKGANWEL HAEFDTQATE ITVVPFYANR TGFFAAEPPL EDLADKNVEH WQSASDQRNI LHFARVPILF ASGRDDESPL TFSASAATTA RDPNAKLEWV EHSGAAIGAG RDDLKDLEFQ MQVLGLQLLV AGTETATGAS LDAAKETAPL AMMADNLKDS LEQALRWFTM YQGSENAVTV KVNKDFGVSM LTAQELTVML TAVNTGNMPR RVFVEEMKRR GFIAEDTDTD AYLGDLDDET PPGLTDGGE
|
| |