Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1687 |
Symbol | |
ID | 5705222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1946346 |
End bp | 1947905 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641271190 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001536565 |
Protein GI | 159037312 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00440693 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGCGA CGGGGAAAGG CACATCGACG GCGCCTGCCC GGATCGTGAT ACTCGTCGAC AATGGTGTCA CTGGCGACTC CCGGGTGCAG AAGACAGCAC GTTCAGCCGC TGACGCCGGG TGGGATGTGA CTCTGCTCGG CAGCTCACCC AACGGGCGGC CCCAGCAGTG GAGGCTCGGC TCCGCTACGG TGCGGCTGCT GCCGATGCCG AGCCCGTTGA GGCAACGTCG GCATGAGCTG CGTCGGCGAT GGTTGCTCGG CCCCCTGGCC TATCCCCCGA CCGGTGTCGC CGCGCGGCGG CAGCAGGAGG TTCGTGCCTG GCAGGCCGAT CTGAGGGTCC GTCGGGCGCT GTTGACCTCC GAGGGGGGCT CCTCTCTGGC TCGCCAATGG CTGCGAGCGC AGGCGCTGGC CGCTCGAGTA GTGCGGAAGT GGATCTCTTT CCGACATTGG CAGTTGACCA ATGGACAAAA AAAGCGTAAG CGGTTGACGA CCCCGTCGGA CCGCCTCTTC ACGTGGTTGC AGCTGCGTCT GCGGGGCGAC CGTGCCTGGC GCCGGCTCGA ACCACAGCTG TGGGACTTTG AGCTTGCCTT CGCTCAGGTT GTTGATCAGC TCAAGCCGGA CATCATCTAT GCCAACGACT TCCGTATGTT GGGTGTCGGC GCACGTGCCA AGATCAGGGC GGCAGCGGCT GGCCGTGAGA TTAAGTTGAT CTGGGATGTT CACGAGTATC TTCCTGGCGT GAAGCCACGA GTGGACAACA ACCGGTGGAT GGTTGCCAAT CAGGCACACG AACGCGAGTA CGCCAGGTGG GCCGATGCGG TGATGACGGT ATCTGACCGA TTGGCTGAGC TGTTACAACG TGATCATGGA TTGGCCGAGC GGCCGTCGAT CGTACTCAAC ACGCCGAACG CGGCTGATGC GTTAGGCGCT CACGGCGCTG ATTCCCAGGA TGTGCGCAGT AAATGTGGAC TCGATCCCGA TGACCCGCTC GTGGTCTACA GTGGGGCGGC GGCAGCGCAC CGCGGCATGG GTGTGATGGT GGAAGCGTTG CCTCGCCTGT CCGACGCGCA CGTGGCATTC GTCGTCAATG CCCCAGCCGG GCCCTACATG AAAAGCCTGG TGGCCCGAGC CCGTGAACTC GGCGTGGCGG ATCGTGTGCA TGTGCTGCCG TACGTCGCGC CGGCGGAGGT GGTTGGTTTC CTGTCCACCG CGACGTTGGG CGTGATCCCG ATTCACCATT GGCTCAATCA TGAGATCCAA CTCATCACCA AGTTCTTCGA GTATTCCCAC GCACGGCTGC CGATTGTGGT CAGTGACGTC GAGACCATGG CGGCCGCTGT ACAGGAAAGC GGGCAGGGTG AAGTCTTCCA GGTTGACGAT GTGGATGGAT TTGTGATGGC GGTCGAGACG ATCCTGGCGG ATCCCCAGCG GTATCGCAAA GTATATGACG CGATGGATCT GAGGGTGTGG ACTTGGGAGG AGCAGGCGCG GGTCCAGAAC AGCATCTACC AGCGATTGGC CCCGCAGGAC CGACACCCTG CCCTGTCGGC GTCCGATTGA
|
Protein sequence | MTATGKGTST APARIVILVD NGVTGDSRVQ KTARSAADAG WDVTLLGSSP NGRPQQWRLG SATVRLLPMP SPLRQRRHEL RRRWLLGPLA YPPTGVAARR QQEVRAWQAD LRVRRALLTS EGGSSLARQW LRAQALAARV VRKWISFRHW QLTNGQKKRK RLTTPSDRLF TWLQLRLRGD RAWRRLEPQL WDFELAFAQV VDQLKPDIIY ANDFRMLGVG ARAKIRAAAA GREIKLIWDV HEYLPGVKPR VDNNRWMVAN QAHEREYARW ADAVMTVSDR LAELLQRDHG LAERPSIVLN TPNAADALGA HGADSQDVRS KCGLDPDDPL VVYSGAAAAH RGMGVMVEAL PRLSDAHVAF VVNAPAGPYM KSLVARAREL GVADRVHVLP YVAPAEVVGF LSTATLGVIP IHHWLNHEIQ LITKFFEYSH ARLPIVVSDV ETMAAAVQES GQGEVFQVDD VDGFVMAVET ILADPQRYRK VYDAMDLRVW TWEEQARVQN SIYQRLAPQD RHPALSASD
|
| |