Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0228 |
Symbol | |
ID | 4076261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 242545 |
End bp | 243768 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638005522 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_612223 |
Protein GI | 99080069 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.219003 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTGC TCGTCGTATC GACGAATGCT GCCCTCACCA TGGGGGGCGA GGCGATGAAG GCGCTGCAGT ATATGCAGCA GCTCTTGGCG GATGGACGCG ATGCCACTCT CATCACCCAT GAACGCTGTC GCGAGGCTCT TGCGGGGCAA TTGCCAGAAG ATCGGGTGAT CTATGTGCAT GACAGCCGTG CAATGAAGGC CTGTTGGCGC ACGCCGGGGC TTGGGCGGTT GGTGAACAGT TTTTTTCACC TCGAGGTCGC CCGGATCTGT CGCGGCTTTA ACCCGAGCGA GGTGGTGATC CACTATCTTT GTCCGATCTC CCCCGTCGAG CAGCGCTTCC CGCCGAAGGG GTATCGCTAT GTCATCGGCC CGCTTTCAGG CAATATCTTC TACCCAGAGG GGTTTCGACA TCTTGCGGGG CGGGGGCTGC GCCTGCAGCA TCAGGCGTAT CGGCCTTTGC AGATGGCGCT TGGCCTCTTG TCTAGGCAAT TCACGCGCGC CTCGACCGTG TTGGTCTCTG GCTATGACCG TACCCGAGAG GCCCTCGGCT GGGCGGGTTG CCCGGAGGCC CGCATGCAGG ACGTCTGGGA TGCGGGCCTG TCTCCAGATT TCTTTGCGCG TTCCCGGATC CGGCCGGGCA AGAACCCGGC GCATTTTGTG TGGATTGGAC GTATGGTGCC CTACAAGGGG GCGGATCTTG CGTTGCGCGC GCTGGCGCTT GCCCCGGCAG AGGCACGGCT CACGCTCTAT GGAGATGGGC CGGATCGTGC CGAACTGGAG GCGCTCGCCC GCGATCTTGG CCTGATGTCG CGGGTCACCT TTGCGGGCTG GCTTGCGCAT GGGGATCTCT CCGAGGCGTT GGGCCAGTAC CGAGCACTTT TGTTCCCGAG CCTCAAAGAA GCCAACGGCA TCATCGTGCA GGAATGTATG GCGATCGGCT TGCCGGTCGT GGCCTTGCGC TGGGGCGGGC CTGTGGGGCT CGCGGATGAC ACTGAGGCGC TGTTTGTCGA GGCGCAGAAT GCCGTACAGG TCGAGCAGGA CTTGGCTGCG GCCATGGCGC GTCTGACAGA AGACCCAGCC CTTGCGGAGG CGCTCTCTGA TGCGGCGCGA CGCAAGGCTG AAAACGAGTT CCCCTGGCCG CAGGTGGCCC AAAGCTGGTG CAGCGCAGCG CTTCGCGCGC AGGATGCGGC TGCAGCAGAG CCAAAGCACC GCGGCGGGGG TTGA
|
Protein sequence | MKLLVVSTNA ALTMGGEAMK ALQYMQQLLA DGRDATLITH ERCREALAGQ LPEDRVIYVH DSRAMKACWR TPGLGRLVNS FFHLEVARIC RGFNPSEVVI HYLCPISPVE QRFPPKGYRY VIGPLSGNIF YPEGFRHLAG RGLRLQHQAY RPLQMALGLL SRQFTRASTV LVSGYDRTRE ALGWAGCPEA RMQDVWDAGL SPDFFARSRI RPGKNPAHFV WIGRMVPYKG ADLALRALAL APAEARLTLY GDGPDRAELE ALARDLGLMS RVTFAGWLAH GDLSEALGQY RALLFPSLKE ANGIIVQECM AIGLPVVALR WGGPVGLADD TEALFVEAQN AVQVEQDLAA AMARLTEDPA LAEALSDAAR RKAENEFPWP QVAQSWCSAA LRAQDAAAAE PKHRGGG
|
| |