Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3832 |
Symbol | |
ID | 4074982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008042 |
Strand | + |
Start bp | 76954 |
End bp | 78234 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 638004490 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_611225 |
Protein GI | 99077966 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.175097 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.13882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAGCC CAAACACTAT GGTAAAGCCG CAGATTGTTT TTGACTTAAC GGAAGTCCTT TTGGCTTCGA CTGGTAAACT TAGGTATTAT GGCATTGCAC GCGTAGCTGC AGAAGCTGGT ATCGCGCTTC GGAAGCTGGA CGCGTCCATT CGGTTCGCTG TCTTCTCGCA GGGCCACCGG GGACTACTTG AGGTCTTTCC GGAAATACGC GAAGACGGCA GTGTAGAGCT AAACGTTCCT GCGGGCATTC GGCAAATTCG TATGCGCAGC CATCACTATA GCAGGTCGAA GCTTAGAGAC CTTATTCTGT CAGCAATTCG ACCGCTGATC GACCGGAAAA ATCGGTTGTT TTGGGATGAG ATTGCGCCTG GAATGCCTCA AATAGAGATG GCGGGTAAGA CGTTTGTCAC CTGCTCCCGC CCCAAAGTGA TCACTGAAAT TTTGTGCGCT ATGGCGAAAC AAGGCGTGTC ATGTGACGTC ATCCCAATGT TGCATGACAT GATACCCCTG CATGATTTTC ATCATCAAAG GGCAAGCTTT CCGAAGAACT TTGTCGGCGA CAACAGGTTC GTCATCGAGA GGGCAAAAGG CCTCTTGTCA GTATCCGAGT TTACGCGCCA AGAAATTATA GATTTTTCGC AGAGCGATGT GTTGCCGGCA GTGCCCGAAA TCATTGCCGT CCCCCTGGTA CATCAGTGCC CGATAGGGAC AGAACCCGCA GAGCAAGCTC CCCCCGATAC TCCATATATT TTGACTGTCG GCTCAATGCT GGGCCGTAAG AACTTAGACG TTGTGTTCGA AGCTTTGCGT GTGTTACAGA GAACAGGTAG CCCATTGCCG AAACTGGTTT TGGCCGGTGC GCCTAGGGGG CGCACACGAA CTTATGTCGC AAGTGCGGAA TGTGATAGCA TTCGGGATCT GGTGCTCTTT TACGAGAACC CAAATCAAAC TGATCTCGTA ACCCTTTATG AAAACGCCAC AGCGGTCATT ATGCCAAGCC GCATGGAAGG GTGGGGTTTG CCAGCGGGAG AGGCTCTCTG GTGTGGCACA CCTGCTATAT GCTCTACGGC TCCTGTGCTT GAAGAAGTTT GCGGCGATTT AGGGTTGTAC TTTGATCCTG ACGCAGCAGA TGAACTGGCA GAATACATTC GCCGCCTACT CACGGATTCT GCGTTTTCGA CGAAGTTGCG CATGCGTATT TCAGAGCACA AGTCCAAATT GCGAACCTGG GATAATGTAG CCGAAGATAT CGTAGCTGCG GTATCTCGCC TCTCGCGCTA A
|
Protein sequence | MSSPNTMVKP QIVFDLTEVL LASTGKLRYY GIARVAAEAG IALRKLDASI RFAVFSQGHR GLLEVFPEIR EDGSVELNVP AGIRQIRMRS HHYSRSKLRD LILSAIRPLI DRKNRLFWDE IAPGMPQIEM AGKTFVTCSR PKVITEILCA MAKQGVSCDV IPMLHDMIPL HDFHHQRASF PKNFVGDNRF VIERAKGLLS VSEFTRQEII DFSQSDVLPA VPEIIAVPLV HQCPIGTEPA EQAPPDTPYI LTVGSMLGRK NLDVVFEALR VLQRTGSPLP KLVLAGAPRG RTRTYVASAE CDSIRDLVLF YENPNQTDLV TLYENATAVI MPSRMEGWGL PAGEALWCGT PAICSTAPVL EEVCGDLGLY FDPDAADELA EYIRRLLTDS AFSTKLRMRI SEHKSKLRTW DNVAEDIVAA VSRLSR
|
| |