Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2033 |
Symbol | |
ID | 5705687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2327112 |
End bp | 2328248 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271523 |
Product | hypothetical protein |
Protein accession | YP_001536894 |
Protein GI | 159037641 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0492847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0634839 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTCC TGTTCGTCTC CTCCCCCGGT ATCGGTCACC TGTTCCCCCT GGTCCAGCTC GCCTGGAGCT TCCGCACGGC TGGCCACGAC GTGGTCGTCG CGCTGGCCGA ACACACCCAG AAGGCCGCCG CCGCCGGTCT GGAGGTCGTG GACGTGGCCC CGGACTACAG CGCGGTCAAG GTCTTCGAGC AGGTGGCCAA GGACAACCCG CGGTTCGCCG AGACGGTCGC CACCCGCCCC GCCATCGACC TGGAGGAGTG GGGTGTGCAG ATCGCCGCAG TCAACCGGCC GTTGGTGGAC CGCACCATCG CCCTCGCCGA CGACTTCACG CCCGACCTGG TCGTCTACGA GCAGGGCGCT ACCGTCGGGC TGCTCGCCGC CGCGCGTGCC GGAGTACCCG CCATCCAGCG CAACCAGAGC GCCTGGCGCA CCCGGGGCAT GCACACCTCG ATCGCCTCCT TCCTCACCGA CCTGATGGAG AAGCACCAGG TCACCCTGCC CAAGCCGAGC GTGATGATCG AGTCGTTCCC GCCGAGCCTG CTGCTGGAGG CAGAGCCGGA GGGCTGGTTC ATGCGTTGGG TGCCGTACGG CGGTGGGGCG GTCCTCGGCG ACCGGCTGCC GGCGTCCCCA CCCCGCCCGG AGGTGGCCAT CACGATGGGC ACCATCGAAC TCCAGGCGTT CGGTATCGGC GCGGTGGCGC CCGTCATCGC CGCCGCCGCC GAGGTGGACG CCGACTTCGT ACTGGCGCTC GGCGACCTCG ACACCACACC GTTGGGCAAG CTGCCGCCGA ACATACGTGC GGTCGGCTGG ACCCCGCTGC ACACGCTGCT GCGGACCTGC ACCGCCGTGG TGCACCACGG CGGTGGCGGC ACGGTGATGA CCGCGATCGA CGCGGGTCTG CCGCAGTTAC TCGCCCCCGA CCCCCGCGAC CAGTTCCAGC ACACCGCCCG GCAGGCGGTC AGCCGACGCG GCATCGGCGT GGTGAGCACC GCCGACAAGG TCGACGCTGA CCTGCTGCGA CGGCTCATCG GGGACGAGTC GATGCGCGCG GCAGTGCGGG AGGTTCGCGA GGAGATGCGG GCGCTGCCCA CGCCGGCAGA GACGGTACGG CGTCTCGTGG AGTATGTCGC CGACTGA
|
Protein sequence | MRVLFVSSPG IGHLFPLVQL AWSFRTAGHD VVVALAEHTQ KAAAAGLEVV DVAPDYSAVK VFEQVAKDNP RFAETVATRP AIDLEEWGVQ IAAVNRPLVD RTIALADDFT PDLVVYEQGA TVGLLAAARA GVPAIQRNQS AWRTRGMHTS IASFLTDLME KHQVTLPKPS VMIESFPPSL LLEAEPEGWF MRWVPYGGGA VLGDRLPASP PRPEVAITMG TIELQAFGIG AVAPVIAAAA EVDADFVLAL GDLDTTPLGK LPPNIRAVGW TPLHTLLRTC TAVVHHGGGG TVMTAIDAGL PQLLAPDPRD QFQHTARQAV SRRGIGVVST ADKVDADLLR RLIGDESMRA AVREVREEMR ALPTPAETVR RLVEYVAD
|
| |