Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1535 |
Symbol | |
ID | 5703516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1770670 |
End bp | 1771788 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641271046 |
Product | hypothetical protein |
Protein accession | YP_001536422 |
Protein GI | 159037169 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0104917 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTAACA ATCGGGGGAT CGTCGTCGCC GCCACCGGTG ACCTGCTCAT CGCACGCGAC CGCCCACACG ACATCTTCCG GTACGTGCGA GACCTGTTGA CCGAGGCCGA CATCACCTTC GGCCAGTTGG AGACCGCGTA CTCCGACCAG GGATCCCTGG GCTCGTCCGG ACCGCGCGGA GGCGTACCGC ACGACGTGGA GAACCTGGTG GCGATCCCGC ACGCGGGCTT CGACGTCATT TCGATGGCGA GCAACCACAC CGGCGACTGG GGGGCCGACG CGTTACTCGA CTGCATCGAG CGGTGCCGGC GCCACGGCAT CACCGTGGTG GGTGCGGGCG CGGACATCGC CGAGGCGCGC CGGCCGGGGA TCATCGAACG GGACGGGACC CGGGTCGGAT TCCTGGCCTA CTGCTCGGTC GCGCCGGATG GCTACTACGC CGGGCCGGGT AAGCACGGTG TGGCGCCGAT GCGGGCGAGA ACGCACTATG AACCGTTCGA GTACGACCAG CCCGGCGGCC CGCCCCTGGT CAGAACCTCG CCGGACGAAT CCGATCTGGC GGCGCTCGTC GCGGACGTCG ACGAGTTGCG CGACCAGGTG GACGTGCTGA TCGTGTCATT CCACTGGGGC CTGCATTTTC AGCCCGCACG GCTCGCGGAC TACCAGCCGG TGGTGGCGCA CGCGGCGATC GACGCCGGTG CCGACGCGGT GATCGGGCAC CACCCGCACA TCCTGAAGCC GGTGGAGGTC TACCGAGGCA AGGTCATCTT CTACAGCCTG GGCAACTTCG CCCTCGAGAT CAACGAGCGC TGGTGGCAGT CGTACAGCAA GGAATGGTTC GAGAAGGCGA ACGAGTTCCA CCAGGAACGT TCTCCCCACC GGGACCTGAA GGAGGAGGCC CGGAACTCGG CGATCGTGCG GCTGCACATC GTCGACGGTC GCATCGACCG GGTTGGGATC GTACCTGTGG TGATCAACGA GGCGCACGAG CCGGTACCGC ATCGGGCGGA CACAACGGAC GGGCGCGCGG TCCGCGCCTA CCTGGCGCAG ATCACGGCCG AGGTGGGGAT CGACACCACC TTCGACGTGG TCGACAACGA GGTCCTGGTC CGCGTCTGA
|
Protein sequence | MGNNRGIVVA ATGDLLIARD RPHDIFRYVR DLLTEADITF GQLETAYSDQ GSLGSSGPRG GVPHDVENLV AIPHAGFDVI SMASNHTGDW GADALLDCIE RCRRHGITVV GAGADIAEAR RPGIIERDGT RVGFLAYCSV APDGYYAGPG KHGVAPMRAR THYEPFEYDQ PGGPPLVRTS PDESDLAALV ADVDELRDQV DVLIVSFHWG LHFQPARLAD YQPVVAHAAI DAGADAVIGH HPHILKPVEV YRGKVIFYSL GNFALEINER WWQSYSKEWF EKANEFHQER SPHRDLKEEA RNSAIVRLHI VDGRIDRVGI VPVVINEAHE PVPHRADTTD GRAVRAYLAQ ITAEVGIDTT FDVVDNEVLV RV
|
| |