Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3984 |
Symbol | |
ID | 5706659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4526575 |
End bp | 4528803 |
Gene Length | 2229 bp |
Protein Length | 742 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273409 |
Product | hypothetical protein |
Protein accession | YP_001538765 |
Protein GI | 159039512 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.580722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00476383 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAAGA AGCTGCTCGG GAAGGTCGTG GCGGGTGCGG CCCTCGGTGG CGCCTCCCTG CTCGTGTGTA CGCCGGGGAT TGCGGTCGCA GACGATCATC ACCGTAAGGA TGAAGGCAAG GTCTTTGCTA AGCCACACGC GGTCGAGGCC GGCCACGAGG TCAAGCTGCT CGAGATCTGC CCCAAGCCGC AGGAGCACGC CTACGTCTGG TCGAAGGTGA CCGGCAAGGT CGATCTCAAG CCGGCTCACG ACCGGCGGGA TGGCAAGGGC TACCGAGACA TGGGCGGTCG CGACTACAAG CGCGACCACA AGGGCGAGGA CTACAAGCGC GACCACAAGG GCGGCCGGGA CGGCAGCGAC GCCGAAGGCC GCGAACACCA CGACTACAAG CGCGACCACA AGGGCGGCTA TGGCGACAGC CAAGCCGAAG GCCGCGGCTG GGGCGACGAA AGCCACAGCG GCTACGGCCA AGCCGAAGGC CGCGGCTGGG GCGACGAAAG CCACAGCGGC TACGGCCAAG CCGAAGGCCG CGGCTGGGGC GACGAAAGCC ACAGCGGCTA CGGCCAAGCC GAAGGCCGCG GCTGGGGCGA CGAAAGCCAC AGCGGCTACG GCCAAGCCGA AGGCCGCGGC TGGGGCGACG AAAGCCACAG CGGCTACGGC CAAGCCGAAG GCCGCGGCTG GGGCGACGAA AGCCACAGCG GCTACGGCCA AGCCGAAGGC CGCGGCTGGG GCGACGAAAG CCACAGCGGC TACGGCCAAG CCGAAGGCCG CGGCTGGGGC GACGAAAGCC ACAGCGGCTA CGGCCAAGCC GAAGGCCGCG GCTGGGGCGA CGAAAGCCAC AGCGGCTACG GCCAAGCCGA AGGCCGCGGC TGGGGCGACG AAAGCCACAG CGGCTACGGC CAAGCCGAAG GCCGCGGCTG GGGCGACGAA AGCCACAGCG GCTACGGCCA AGCCGAAGGC CGCGGCTGGG GCGACGACAA GCGCGACCAC AAGGGCGACT GGGGCGCCGA CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG AAGACTACAA GCGCGACCAC AAGGGCGACT GGGGCGCCGA CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG AAGACTACAA GCGCGACCAC AAGGGCGACT GGGGCGCCGA CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG AGGACTACAA GCGCGACCAC AAGGGCGACT GGGGCGCCGA CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG AGGACTACAA GCGCGACCAC AAGGGCGACT GGGGCGCCGA CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG ACCGGGGCGC CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG AGGACTACAA GCGCGACCAC AAGGGCGACT GGGGCGCCGA CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG ACCGGGGCGC CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG AGGACTACAA GCGCGACCAC AAGGGCGACT GGGGCGCCGA CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG AGGACTACAA GCGCGACCAC AAGGGCGACT GGGGCGCCGC CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG AGGACTACAA GCGCGACCAC AAGGGCGACC GGGGCGCCGA CGACGCCGAA GGCCGCGAAC ACCACGACTA CAAGCGCGAC CACAAGGGCG AGGACTACAA GCGCGACCAC AAGGGCGACC GGGGCGCCGA CGACGCCGAA GGCCGCGGTG GAGACCGCTG GGAGCACAAG CGGGACTACA TCTACTACGG TGAGGCTACG GTCGACAGGA ACGCCAAGCC GGGCACCTAC AAGCTCGAGG GCTCGTGCGG CGAGGGTGAA CTCGTCGTCC TGCCCCGCGG CGGGGTCGAC GGTGGTGACG GCGGCATGAG CGCTGGCACC AACCGGGGGC TAGCCGCCGG TGGGGCTGGT CTGCTCGGTG CAGCTGCTCT GGGCGGCCTG GTGCTGCTCC GTCGTCGAAC CGATGGTTCC CTGGTCTGA
|
Protein sequence | MAKKLLGKVV AGAALGGASL LVCTPGIAVA DDHHRKDEGK VFAKPHAVEA GHEVKLLEIC PKPQEHAYVW SKVTGKVDLK PAHDRRDGKG YRDMGGRDYK RDHKGEDYKR DHKGGRDGSD AEGREHHDYK RDHKGGYGDS QAEGRGWGDE SHSGYGQAEG RGWGDESHSG YGQAEGRGWG DESHSGYGQA EGRGWGDESH SGYGQAEGRG WGDESHSGYG QAEGRGWGDE SHSGYGQAEG RGWGDESHSG YGQAEGRGWG DESHSGYGQA EGRGWGDESH SGYGQAEGRG WGDESHSGYG QAEGRGWGDE SHSGYGQAEG RGWGDDKRDH KGDWGADDAE GREHHDYKRD HKGEDYKRDH KGDWGADDAE GREHHDYKRD HKGEDYKRDH KGDWGADDAE GREHHDYKRD HKGEDYKRDH KGDWGADDAE GREHHDYKRD HKGEDYKRDH KGDWGADDAE GREHHDYKRD HKGDRGADAE GREHHDYKRD HKGEDYKRDH KGDWGADDAE GREHHDYKRD HKGDRGADAE GREHHDYKRD HKGEDYKRDH KGDWGADDAE GREHHDYKRD HKGEDYKRDH KGDWGAADAE GREHHDYKRD HKGEDYKRDH KGDRGADDAE GREHHDYKRD HKGEDYKRDH KGDRGADDAE GRGGDRWEHK RDYIYYGEAT VDRNAKPGTY KLEGSCGEGE LVVLPRGGVD GGDGGMSAGT NRGLAAGGAG LLGAAALGGL VLLRRRTDGS LV
|
| |