Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0548 |
Symbol | |
ID | 4077195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 582746 |
End bp | 585754 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005845 |
Product | glycosyl transferase family protein |
Protein accession | YP_612543 |
Protein GI | 99080389 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCCTGA ATGAGAGCTG CAATCAACTG GGGGTTCTGG TGCCGGACTG GCTCCGGCTC GAAACCACCC CCGAAGGGCC CGTGGTGAAG ATCGAATCCG AAGAAAGCCG CGCGCCTCTG GTGGATTACC GCAGCACATC GGGCCATGCC CCAGATCTGA TGCCAGTGCT TGAAGTGGAC ACTGGCAGCG ACAAGGCAGG TTTCTTGCGC AACCTCAGCA CGCCCGAGAC TGCTCAAAGC GTAACGTCTC AGGTGCTTGC ACAGCTTCGA CCGCTCTATG CCCAAGGAAC CTGCATCAGC ATTCCTGGGC TTGAGACCGC TGATCTCGAT GTGTTGCAGC CGTTTTTCAA AACCCTGACC TCGAGCCTGA GAAGCGAAGG CCATAGCCCC TGCCTCATCC TGAGCGGCAC CTCGACGGCG TGGCAATCGC GCGAAACCAC AGCGCTGTTT GATAAGGTCA TCCTCAAGCT CTTTCTCGAT CCCTGGGTGG GCACCGCGCC TTCGCCGCTG GCCACGGACG CCTGGTTCGA GAAAACCGCC AAGGCCGCAC TTCAGGAGAT TGGCAAAGAC AAACTGGTGA TCGCTCTTGG AACCTTCGCG GTGGAGTGGG TGTCGGGTGA GCCGCTGCCC AAGGTTCTGC CCTATGCGGC GGCAATGGAG AAAATTGCAG CCGCAGGCGC CGAGCTGCGC TTCAGCGAAA AGACCTCCGG ATCGCTTGCG TCTTATCGCG ATCCCGAAGG GCGCCTCAAC AAGATCTGGA TGCAGGACGT CGCCAGCCTC ATCAACCAGC TTGTCATCCT GCAGCAGCTT GAGATCCCCA ATAGCGCTGT CTGGTCGCTT GGCCTCGAGG ATCCGGGTAT CTGGAGCGTG CTGCAAAACC GTGACCTGAG CCACGACGCG CTGAGTGCAG ACTTGGCTCT GGTGAAGCTC GACTCATATG TGAGCTATCG CGGCGAAGGC GCGTTGCTGC GCCTTCACCG CCGGCAGTCT CCCGGCATCC GCCAGATCGG GTTTGATACG GAAACCGGGC GTGTAGTCTC GCAGAGCTAT GACCTGCTGC CGCGCCCCTA CGCGCTGGAG CGCTATGGAA AACCGGCCGG CCGCAAGGTG GTTTTGACCT TCGACGATGG CCCACATCCG GTGTTCTCCG AGCAGATCCT CGACATCCTG CAGGAAACCC AGACGCCCGC GACCTTCTTT GTGACCGGCA AAAGCGTGAT GAACGCGCCC GAGGTTTTGA ACCGGATGAT CGACGAGGGA CATGAAATCG GCGCACATAC GTTCTCTCAC CCCCGGATGG ATCAGGTCTC CAAGACCCGC GCGACGCTTG AATACGCGAT GCTCGACAAG GTGGTGGCCG GGGCGGCAGG TCGGCAGTTG ACCCTCTATC GTGAACCTTT CCAGCGCAGT GGCGGCCCGG TGACGGCCGA TCGCGTCGCC GCGCTCGAGA TTGCTTGGGA TCGCGACATG CAAGTGGTCG GCATGGATGT GGTACCCCAC GACTGGGCCG GATGGAGCGG CCGCGAGATT GCGGACTTTG CTATCGAGGA AGTCGAACGC GGTGCAGGCA ACGTGATCTT GCTGCATGAC GGCGGCGAGG ATCGCACCGC TTCGGTCGAG GCCACGCGGC TGATCATCAC CGAGCTCTCT GCCAAAGGCT ATGAGTTTAC CACCGTGGCC GACTTGACCG GCAGCACCCG TGCAGCATTG ATGCCCGTGA CCGAGGGCGG TTATCAAACC TTTGACCGTG TTTCCTTTTC TCTCGTCGCT TGGGGTCAGG ACGCCATCGT GATCCTGTTC TGGCTGGCGC TCGGCATCGG GGTTGTGCGC TCTGTTGCGA TCTTGTTGCT CGCCGTCCTG AATTGGCGCG GACATCGCAC CATCTCGCTG ACCACCCCAA AGGTGGCGGT GATCATCCCG GCCCACAACG AGGAGAAGGT CATCCGCAGC TGCATCCAGA GCGTGCGGGC AAGCGACTAC AAGAACCTCG AAATCATCGT GGTCGACGAT GGCTCCAGCG ACAATACGCT GAACGAGATC TTTGCCTTTT CGCACATGCG CGAGGTCCGC CTGATCTCGC AGCCGAACCA GGGCAAATGG AGTGCGCTGA ACCGGGCGCT GATGAACACA TCCGCCGAGA TTGTGGTCTG TATCGATGCA GACACGCAGA TCGAGAAATC CGCCATTGGG CACATGGTCA GACATTTCGA CAACCCAAGG ATCGGTGCGG TCGCGGGCAA GATCATCGCG GGCAACAAGG TGAACCTTCT GACCCGACTG CAGGCGCTGG AATATACCAC CGCGCAGAAC GTTGAGCGCA AGGCCTTTGA TCTGATCAAC GGCATGCTGG TGGTGCCCGG CGCCCTCGGT GCATGGCGCG TGGCTGCGCT GCGCAAGGCG GGGCACTTCA GCGACGAGAC GATGACCGAA GATACCGACC TCACCATCGA GGTCAACCGT GCAGGATACC GGATCGCATA TGAGCCGCTC GCCCGCGGCT ACACCGAGGT ACCCGAGCGC ATTGGGCAGC TTTTGAAACA GCGCCTGCGC TGGTCGTTTG GCATGTTCCA AAGCGCATGG AAGCACAAAA AAGCGATGTT CGAGGGGCGC TCTGTGGGGT TGATTTCGAT CCCTGACATG TTCATCTTTG GCTATCTCTT CCCACTGCTG GCGCCGATTG CGGACCTCTT TGTCGCCATC CTGCTTTACC AGATGGTCAG CGGCGGTTGG GACAGCGGGG CGGTTGGCGC GCAGAACATG CAGTATCTCC TCGCCTACCT CACCCTACCC GCGCTCGAGT TCGTGATTGC CGCCTTTGCC CTCGCACGGG ACAAGGATGA GAGCATGTGG TCGCTGTTGC TGTTCCCGGT CCAGCGGGTT CTCTACCGGC CGATCCTCTA TTACTCCGTG ATCCGTGCGA TCCTGCGGGC CATCACGGGC CGCCTGTTCA GCTGGGGTGC GCAGAAACGG CTGGGGCGTG ACTACAGCCT TGCGACGAGC GGCACATGA
|
Protein sequence | MSLNESCNQL GVLVPDWLRL ETTPEGPVVK IESEESRAPL VDYRSTSGHA PDLMPVLEVD TGSDKAGFLR NLSTPETAQS VTSQVLAQLR PLYAQGTCIS IPGLETADLD VLQPFFKTLT SSLRSEGHSP CLILSGTSTA WQSRETTALF DKVILKLFLD PWVGTAPSPL ATDAWFEKTA KAALQEIGKD KLVIALGTFA VEWVSGEPLP KVLPYAAAME KIAAAGAELR FSEKTSGSLA SYRDPEGRLN KIWMQDVASL INQLVILQQL EIPNSAVWSL GLEDPGIWSV LQNRDLSHDA LSADLALVKL DSYVSYRGEG ALLRLHRRQS PGIRQIGFDT ETGRVVSQSY DLLPRPYALE RYGKPAGRKV VLTFDDGPHP VFSEQILDIL QETQTPATFF VTGKSVMNAP EVLNRMIDEG HEIGAHTFSH PRMDQVSKTR ATLEYAMLDK VVAGAAGRQL TLYREPFQRS GGPVTADRVA ALEIAWDRDM QVVGMDVVPH DWAGWSGREI ADFAIEEVER GAGNVILLHD GGEDRTASVE ATRLIITELS AKGYEFTTVA DLTGSTRAAL MPVTEGGYQT FDRVSFSLVA WGQDAIVILF WLALGIGVVR SVAILLLAVL NWRGHRTISL TTPKVAVIIP AHNEEKVIRS CIQSVRASDY KNLEIIVVDD GSSDNTLNEI FAFSHMREVR LISQPNQGKW SALNRALMNT SAEIVVCIDA DTQIEKSAIG HMVRHFDNPR IGAVAGKIIA GNKVNLLTRL QALEYTTAQN VERKAFDLIN GMLVVPGALG AWRVAALRKA GHFSDETMTE DTDLTIEVNR AGYRIAYEPL ARGYTEVPER IGQLLKQRLR WSFGMFQSAW KHKKAMFEGR SVGLISIPDM FIFGYLFPLL APIADLFVAI LLYQMVSGGW DSGAVGAQNM QYLLAYLTLP ALEFVIAAFA LARDKDESMW SLLLFPVQRV LYRPILYYSV IRAILRAITG RLFSWGAQKR LGRDYSLATS GT
|
| |