Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3460 |
Symbol | |
ID | 4075094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 483277 |
End bp | 484920 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638004969 |
Product | hypothetical protein |
Protein accession | YP_611694 |
Protein GI | 99078436 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02231] conserved hypothetical protein |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTTG TTCTCTCTTC TGTCCTTGTT GGGGCTGCGG CCCTGTTGTC CACGACCGCA TTGGCGGAGA CCTTCACCGC CGCAAGCCGC GTCAGCGCGG TGACCGTCTA TCCCAGCGAG GCATTGATGA CCCGCACCGC CGAAGTGACG CTGCCCGCAG GCCGTCACCG GATCATCATC ACGGGAATGC CCTTTGTGGA TGAGGTGGAA ACCCTGCAGG TGACGCATCC CGGCGTGCGC CGGGTCGGGC TGTATCTGCG CGAGAGTTTT CCGGTTCTCG AGGAGACCGC GACCCCCGAA CAGGCCACGG CTGAGGCTCG GGTGGCGGAG ATTGAGGACC AGATAGATGC GCTGCGCCAG ACCGCAGAGG AGGCGCGACT GGCGGCGCAG GGCGCCGAGG CAGCGATCGC CTTTCTGAGT GCCCTTAACC GGGGCGAGGG AGCGGCACTG CCCGACCCCG ATCAGTTGCG CGCACTGGTG AGCACGGTGC GCGAGGAAAC CTCCGAGGCA CGCGAGACCA TTTTGCGGGC CAAAGCGCAG GAGCGCGGGT TTCAGCGCCA GATGAAGGAT CTCGAGGCCG AGCTGGCCCG GGCGCAGGCG GAGCTTCGGG CGCTGTCGCA GCAGGATGAA GAACAGATGT TCCTCGCACT GGATGTCGAG GCGGATGAAG AAGTGACGGT ACCGCTCAGC CTAAGCTACC CCAATGATGC GGTGTCATGG GGGCCGTCTT ATGCGTTCAA TCTGCAAACA GGCACCTCGC CAGAGGTCAC TTTGCGCCGG GATGTGATGA TCCAGCAGGC CACTGGCGAG GATTGGGTGG ATGTGGCGCT GCGGGTCTCG ACCTCGACCC CGGATCAGAG GATCAACCCA CATGACCTGC GTCCCAATCG GCGTTGGATC GAGGACGAGC AACAGGCCAA ACAGCGTTTC ACCTCAGAAA GCCGGGTGAT GAGCGCCGCC GAGCCGATCG TCGAGGCCCC GATCATCGTC GAGGACACAG GTGCCTCCTT TGGTGTCGTC GCCAGCGAGG CAGGGGTTTC TTACAGCTTT GATGTGCCTG TCAGCCTGCG CTCGGGGGCG GAGTTGGCCT ATCTCACCCT GCCGGATGTC ACGTTTGAGG CTGAGGTCTT TGCCCAGGCC GTGCCACGCC GCGATCCGAC CGCCTTTCGC ATTGCGCGCA TCGTGAACAC CAGTGGCGAG GAGCTGTTGG CAAGTTCGAC CACGCGCTAT TTTGTTGATG GCGACCTTGT TGGATCTGGG CATTTCGCGG GGCTCACGCC CGACGCCGAG CTCGATCTGG GTTTTGGCCC GATTGATGGC CTGCGCCTCA AACGGGACCT TCTGGATCAG AGCGAAGGTG GGCGCGGCGT GATCTCGCGC AGCAACCAGC GCGATATGGT GGCCGAGATC GAGGTGGAGA ACCTGACCGG TCAGGACTGG CCGCTGCGCG TTCTTGATCT GGTGCCCTTC AGCGATCAGG AGGATCTGGA GATCACATGG TCTGCAACGC CCGCGCCCTC CGAGGAAAAC GTCGACAAAC AGCGCGGAAT CCTCGCCTGG GATCTCGATG TGTCGGCAGG AAAGACGCGC ACCATCAGCC TCAAGACCCG CCTGAGCTGG CCTGAGGGCA TGGTCCTGCG CTGA
|
Protein sequence | MRFVLSSVLV GAAALLSTTA LAETFTAASR VSAVTVYPSE ALMTRTAEVT LPAGRHRIII TGMPFVDEVE TLQVTHPGVR RVGLYLRESF PVLEETATPE QATAEARVAE IEDQIDALRQ TAEEARLAAQ GAEAAIAFLS ALNRGEGAAL PDPDQLRALV STVREETSEA RETILRAKAQ ERGFQRQMKD LEAELARAQA ELRALSQQDE EQMFLALDVE ADEEVTVPLS LSYPNDAVSW GPSYAFNLQT GTSPEVTLRR DVMIQQATGE DWVDVALRVS TSTPDQRINP HDLRPNRRWI EDEQQAKQRF TSESRVMSAA EPIVEAPIIV EDTGASFGVV ASEAGVSYSF DVPVSLRSGA ELAYLTLPDV TFEAEVFAQA VPRRDPTAFR IARIVNTSGE ELLASSTTRY FVDGDLVGSG HFAGLTPDAE LDLGFGPIDG LRLKRDLLDQ SEGGRGVISR SNQRDMVAEI EVENLTGQDW PLRVLDLVPF SDQEDLEITW SATPAPSEEN VDKQRGILAW DLDVSAGKTR TISLKTRLSW PEGMVLR
|
| |