Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2328 |
Symbol | |
ID | 5704252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2676954 |
End bp | 2678255 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641271806 |
Product | glycosyl transferase family protein |
Protein accession | YP_001537177 |
Protein GI | 159037924 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0735299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00124939 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTTGATCG CTACCACACC GGCACCCGGA CACGTCGTCA GCATGGTGGA TGTGGCGGGC GAACTGACTC GGCGTGGACA TGAGGTGCGG TGGTACACCG GCCGTGCCTT TCAGGAGCAG GTCGAGCAGG CCGGGGCGCG CTTCGAGCCG ATGAGTGAGG CACTCGACTT CGGCGGCAGG AGTCGGGAGG AGGCGTTTCC CAGCCACGCC GGGCTGACCG GCCTCGCCAG TTTCAAGATC GGCGTGCGTG ACATCTTCTA CCACACGGCG CCCGGCCAGC TCGACGACCT ACTTCGCGTG CTCGAACGCT TTCCCGCCGA CTGCATCCTC GCCGATGACA TGTGTTATGG CGCGTGTTTC GCGAGCGAGC GAACCGGCCT GCCGATGGCC TGGCTGAGCA ATTCGATCTA CATCCTGGGC AGTAGGGATA CCGCACCGCT CGGGTTGGGG CTGCAACCGA GTTCGTCGCC GCTGGGACGG GCCCGTAACG CTCTGCTGCG ATTCCTCGGT GACCATGTGT CCATGCGAGA CCTACGCCGG GAGGCCGACC GCACGCGAGC CTCGGTGAAC CTGCCGCGGC TGAGGACACG GGCCATGGAG AACATCACGC GCCCCCCAGA TTTGTACCTG GTGAGCACCG TGCCGTCCTT CGAGTTCTCC CGCAGCGATC TCCTACCAGG CACACACTTC ATCGGCGGTC TCTTCGGACT TCCCCCGGAG CGGTTCGAGC CGCCCAGTTG GTGGCAGGAG TTGGATGGAG ACAAGCCGGT GGTGCTCATC ACCCAGGGCA CCACCGCCAA CGACGTCGAC CGGCTACTCG TTCCCGCAGT CCGGGCGCTG GCCCGCGAGG ACCTGCTCGT CGTGGTGACC ACGGGAAGCG ACCTGGATGT CGACCTGCTA CGGCCGCTAC CCGGCAACGT CCGGTTGGAG CGGTTCGTTT CCTATCACCA TCTGTTGCCC CGCGTGGACG TGATGCTGAC CAACGGCGGC TACAACGGTG TCAACGCCGC CCTCGCCCAT GGCGTACCCC TGGTCGCCGC TCCGGCGACC GAGGAGAATC CCGACGTCGC GGCCCGGATC GCGTGGTCCG GAGCGGGCGT CGTTCTCGCC CGGCGCGCGG TGTCAGAGGC CACCCTGCGT AACGCCGTAG TCACCGTTCT GCACGACGAG CGCTACCGGC AACGGGCACA CGTGCTGTCC CGCGAGCACC AGCGCTACGA CGCCCCGCGA CGAGCCGCCG AGCTCATCGA GGCCATGGCC GAATCCCAGG GCCGAGTCCC TACCGGAGGT CCCACCCAAT GA
|
Protein sequence | MLIATTPAPG HVVSMVDVAG ELTRRGHEVR WYTGRAFQEQ VEQAGARFEP MSEALDFGGR SREEAFPSHA GLTGLASFKI GVRDIFYHTA PGQLDDLLRV LERFPADCIL ADDMCYGACF ASERTGLPMA WLSNSIYILG SRDTAPLGLG LQPSSSPLGR ARNALLRFLG DHVSMRDLRR EADRTRASVN LPRLRTRAME NITRPPDLYL VSTVPSFEFS RSDLLPGTHF IGGLFGLPPE RFEPPSWWQE LDGDKPVVLI TQGTTANDVD RLLVPAVRAL AREDLLVVVT TGSDLDVDLL RPLPGNVRLE RFVSYHHLLP RVDVMLTNGG YNGVNAALAH GVPLVAAPAT EENPDVAARI AWSGAGVVLA RRAVSEATLR NAVVTVLHDE RYRQRAHVLS REHQRYDAPR RAAELIEAMA ESQGRVPTGG PTQ
|
| |