Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3474 |
Symbol | |
ID | 5708076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4006327 |
End bp | 4007454 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641272901 |
Product | glycosyl transferase family protein |
Protein accession | YP_001538267 |
Protein GI | 159039014 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.288784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0309659 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCTGT TGCTGATGGT GCTGGCCGGC GTGGCCGCGC TGACCGCGCA CACCCTGGTC AACGCCGGTC GTTGGCTGCG CCGCCCGACT GGGACGCCAG CCGAGGTGAC CGAGCCGGTG GCGGTGCTGC TGCCGCTGCG CGACGAGGCC ACCCGGGTAA CCCCGTGCCT ACGCGCGCTG CTGGCTCAGC GTGGTGTGCC GGAGCTACGG ATCGTGGTGC TCGACGACGG GTCGACCGAC GGCACCCGCG AGGTCGTCCA CGCGGTCGTC GGGGACGACC CACGCGTCAC CCTGCTCGAC GGGGGCGCCC CACCCCCCGG CTGGCTGGGC AAACCACACG CCTGCTGGCA GCTCGCCACC CGGGCCGACC CGGCCGCCAC CGTGCTGGTC TTCGTGGACG CCGACGTCGT GCTCGCCCCA CACGCCGTGG CCGCGGCCGT CGGCGAACTG CGCGCCGCGC GGGTGGCACT GCTGTCGCCG TACCCCCGAA TCCTGGTCAC GACGGTGGCC GACCGGCTGG TCCAGCCGCT GTTGCAGTGG TTGTGGCTGA CGTTCCTGCC GCTGCGCGCG ATGGAACGGT CGGCCCGACC GTCTCTGGCC GCGGCCGGTG GGCAGTTCCT GGTCGTGGAC CGGATCGGTT ACACCGCCGC GGGCGGGCAC GCGGCGGTGG CCGACCGGAT CCTGGAGGAC GTCGAACTGG CCCGGGCGGT CAAACGGTCC GGCGGCCAGA TAGCTCTCGC CGACGGCTCA CGGCTGGCCA CCTGCCAGAT GTACGACGAC TGGCCGCAGT TGCGGGACGG CTACTCCAAG TCGCTGTGGG CCTCGTTTGG CCGTCCTACA GCGGCTGCCA CGGCGGTAGC GGTGCTCCTG CTGCTCTACA CCGCCCCACC GCTGGTCGCT GTGGTCACGT GGGCCAGTGG TGCACCGGGG ACGGCCACCG TCGCCGCCGG GACGTACCTG CTCGGGGTCG CCGGACGGGT GGTCAGCGCC CGAGCGACCG GCGGCCGGTG GTGGCCTGAC GCGTTGGGCC ACCCCGCGTC GGTCGCGGTC CTCGGTTGGC TGACCCTGCG GTCGTACCAT CTGCGGAAGC AACGGCGCCT GAGCTGGCGG GGCCGTCCGG TCGTCTAG
|
Protein sequence | MILLLMVLAG VAALTAHTLV NAGRWLRRPT GTPAEVTEPV AVLLPLRDEA TRVTPCLRAL LAQRGVPELR IVVLDDGSTD GTREVVHAVV GDDPRVTLLD GGAPPPGWLG KPHACWQLAT RADPAATVLV FVDADVVLAP HAVAAAVGEL RAARVALLSP YPRILVTTVA DRLVQPLLQW LWLTFLPLRA MERSARPSLA AAGGQFLVVD RIGYTAAGGH AAVADRILED VELARAVKRS GGQIALADGS RLATCQMYDD WPQLRDGYSK SLWASFGRPT AAATAVAVLL LLYTAPPLVA VVTWASGAPG TATVAAGTYL LGVAGRVVSA RATGGRWWPD ALGHPASVAV LGWLTLRSYH LRKQRRLSWR GRPVV
|
| |