Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2837 |
Symbol | |
ID | 4076656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 3004905 |
End bp | 3006122 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638008166 |
Product | hypothetical protein |
Protein accession | YP_614831 |
Protein GI | 99082677 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4421] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.839793 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGCAA CTGAACACCC GGCACCGACC ACGCCTCCCT CGCCGGAGGG CGGCTGGTCG GAAAGCATCT CGGTGCTGCG CAATGTGACG GTGGTGCCTC CGGTCGAGAG CAACCTCGTG CAGGCCGCCG GTCTTTTGCG CGAGGACGGC AGCTATTGTG CTGAGGGTGC CTTGTGGCGC AGGCATCGAC CCATCACGAC AGAACCTCCA AAACCCTCGG AGTTTGCGGA AAAAATCTCT GGACGTTGGC TGTGGGGTGG CGTGCTATGG GCGCATTTCG GGCACTTTCT GGTCGAAAGC ACCGCCCGTC TCTGGGCGCT GTCAGAACTG GATGCGCCGG TGGATGGGGT GTTGTTCATT CCAAAACGCC CCGCCGTCAG AGATCAGGTG CGCGGGTTTC AGGCCGAATT CGTCGATCTC ATGCAGAGGG ACCTGCCAAT CCGCGTTGCA GCGGATCCGT CTCTGGTTGA GGAACTTGTG ATCCCCGGGC AGGGGTTTGG CCTTGGGAGG ATTACCGAGG CAACGCCCAA GTACCGCAAC GCGATCCATG CCCGTTTTGC GCGCGACATC AAACCCGAGG GGCCGGAGAA GATCTACATC TCACGCTCCA AGCTGGGGCT CGGCAAGGGC GGGCTGTTGG GCGAAGAGCA GATGGAAGCC TTCCTCGCGG CGGAGGGCTA CGAGATTTTC CACCCACAGG AACATACCCT GTCGGAGCAG CTGGCGCGCT ACAAGGCGGC GCGCAAGGTG ATCGCGGCTG ATGGTTCCGC GCTGCATCTT TATGCAATGG TGGGGCGGCC CGATCAGAAG GTTGCGATGG TTCTGCGGCG CAAATCCACC GCGCATACGC TGTTGACCGA CAACGTACGT TACTTCTGCA AGTGCGACCC CTTGGTGATT GGTGCATTAC GCACGGAATG GGTGCCCAAG AACAATCAAC GCTCCAGCCG TCTGAGCTTT GGGGAACTGG ATCATTCTGT TATCGGCCGG GCGCTCCACG AGGCAGGCTT TATTTCGGGT GGGAAAAACT GGCCGGTGCT GGATGACGCC GCGCGCAATC AGGTGCTCAA AGACAAAGGC ATTAAAAGTG ATCGCTTTGT CGAGTCCCCC GCGTTTCGCA AGGCGCGCGA GGAAAAGGAA CGGGCGGAGC GTCGCGCCCG TCGCGCAGCA AGACACGCCC GCAGGCAGGC TCGCGCCGCT GCGCAAAACG ACGGCTAA
|
Protein sequence | MCATEHPAPT TPPSPEGGWS ESISVLRNVT VVPPVESNLV QAAGLLREDG SYCAEGALWR RHRPITTEPP KPSEFAEKIS GRWLWGGVLW AHFGHFLVES TARLWALSEL DAPVDGVLFI PKRPAVRDQV RGFQAEFVDL MQRDLPIRVA ADPSLVEELV IPGQGFGLGR ITEATPKYRN AIHARFARDI KPEGPEKIYI SRSKLGLGKG GLLGEEQMEA FLAAEGYEIF HPQEHTLSEQ LARYKAARKV IAADGSALHL YAMVGRPDQK VAMVLRRKST AHTLLTDNVR YFCKCDPLVI GALRTEWVPK NNQRSSRLSF GELDHSVIGR ALHEAGFISG GKNWPVLDDA ARNQVLKDKG IKSDRFVESP AFRKAREEKE RAERRARRAA RHARRQARAA AQNDG
|
| |