Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2036 |
Symbol | |
ID | 5705690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2330656 |
End bp | 2331849 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271526 |
Product | glycosyl transferase family protein |
Protein accession | YP_001536897 |
Protein GI | 159037644 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000746123 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCCCATC TGCTGTTCGT CAACGTGGCC AGCCACGGTC TGGTCCTGCC CACCCTGGCG GTGGTCACCG AGCTGGTCCG GCGCGGACAC CGGGTCAGCT ACGTCACCGC GGGCGGGTTC GCCGTGCCGG TCGCGGCGGC CGGCGCGACC GTGGTGCCGT ACGCCTCGGA GATCATCGAC GCTGACGCCG CCGAGGTGTT CGGTGCCAAC GACCTCGGTG TCCGACCGCA CCTGATGTAT CTGCGAGAGA ACATGTCGGT GCTGCGGGCC ACCGCTGCCG CGCTCGACGA CGACGTCCCG GACCTGGTTC TCTACGATGA CTTCCCGTTC ATCGCCGGGC AGCTGCTGGC CGCCCGCTGG GATCGACCGG CCGGCCGGCT CAGCGCCGCC TTCGCCTCCA ACGAGCACTA CTCCTTTTCC CAGGACATGA TCGGGTTGGC CGGGACGATC GACCCGCTGG ACCTCCCGGC GTTCCGGGAC AACCTGGCGG CGTTGCTCGC CGAGCACGGT CTGACCCGGT CGGTGGTCGC GTGCTGGCAG CACGTCGAGC AGTTCAACCT GGTGTTCGTG CCGAAGGCGT TCCAGATCGC CGGGGAGTGC TTCGACGAGC GGTTCGAGTT CGTGGGGCCC TGTTTCGGGC AACGACGCTA TCTCGGGCGG TGGACACCCC CCACCGACGA CCGACCGGTC GTGCTGGTGT CGCTGGGCAC CACCTTCAAC GACCGGCCGG GGTTCTTCCG TGACTGCGCC CGCGCGTTCG CCGATCAGCC CTGGCACGTG GTGATGACCC TCGGCGACCA GGTCGATCCG GCGCAGCTCG GTGAGTTGCC ACCGAATGTG GAGGCGCACC CGTGGGTGCC GCACGTGGAG GTGCTGGAGC GGGCGAGGGT CTGCGTGACG CACGGTGGCA TGGGCACCCT GATGGAGGCG CTGCACTGGG GGCGTCCACT GGTTGTCGTG CCGCAGTCCT TCGACGTGCA GCCGATGGCC CGCCGGATCG ACCAGCTCGG TCTCGGTGTG CTTCTGCCCG GGGCGAAGGC CGACGGGCAG GAGCTGCTCG CCGCTGTCGA GCGGGTGGCC GGCGACCCGG CGCTGGCGCA GCGGGTGGCG GCGATGCGGG AGCAGGTGCG GCGGGCCGGC GGCGCGTACC GCGCCGCCGG TGCGATCGAG GCGTATCTGT CCCGGCGCCG GTGA
|
Protein sequence | MAHLLFVNVA SHGLVLPTLA VVTELVRRGH RVSYVTAGGF AVPVAAAGAT VVPYASEIID ADAAEVFGAN DLGVRPHLMY LRENMSVLRA TAAALDDDVP DLVLYDDFPF IAGQLLAARW DRPAGRLSAA FASNEHYSFS QDMIGLAGTI DPLDLPAFRD NLAALLAEHG LTRSVVACWQ HVEQFNLVFV PKAFQIAGEC FDERFEFVGP CFGQRRYLGR WTPPTDDRPV VLVSLGTTFN DRPGFFRDCA RAFADQPWHV VMTLGDQVDP AQLGELPPNV EAHPWVPHVE VLERARVCVT HGGMGTLMEA LHWGRPLVVV PQSFDVQPMA RRIDQLGLGV LLPGAKADGQ ELLAAVERVA GDPALAQRVA AMREQVRRAG GAYRAAGAIE AYLSRRR
|
| |