Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1931 |
Symbol | |
ID | 5704778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2227400 |
End bp | 2228407 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271436 |
Product | hypothetical protein |
Protein accession | YP_001536807 |
Protein GI | 159037554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.464515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00506626 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGCGA TGGAGTTCAC CCGACACGGC GCCCTGTTGG TGCCGGCCGG AACGCTACCC GCCGAGGCGG GCGAGATTGA CCGCCCGGCC GCACCGGACC CGGTTTCCGC ACCGGTCGCC GCGCCGCAGG TGCCGGCACG GCCAAGCCGG GTCGAGCGGC GCCGGGCACG CCGTGACGCC GACGCGCGGG CCGCCCGGGC GGCGGTGCTG GCCGGGGAGA CGAAGAAGGT TCAGCGTTGG ACCGCTCGCG CGGCTGAGGC CCGGCACCTG CGGCGGCTGC TGCTCGACCC GGATGTGCAA GCGGTGCGGC TGATGCGCCA GCGGGCACGC TGGTCGGCCA TGGCGTGGTC AGCGCTGGTG TTCGCGCTGC TGTTCACGAT GGTCAACGTC CAACGGTTCG CCGCCGGCCA CGCCGACCCG TGGTCCCCGA TGTGGATCGT GGCGTGGCTG GTCGATCCGG CGTTCTCGAT CCTGCTGGTC GGTCTGCTCA TCGCCCGAGG ACACCTGTCG GCCGTCCGGC GACGGGTCAC CGAACCCATC GTCAAGCACG TCGAGTACGG CCTACTGGCC GCCACAGCAG CGATGAACGT GGCCCCAGAG TTGACCCAAC GCTTCCCCGG CGGAATCGCC GAACAGGTCG CCTCGGTCGT GCTGCACCTG CTGGTGCCGC TGCTGGCGTT CGCCGCCGCG AGCGTGATCA CGCTGATTCA AGACCACTTC GCCGCCGCAA TCGCCGCCCT CACCAACCCG CCCGACGGTG GCCTTCCGAC CGGCATCAAC CTGCACGAAC ACCCCGATAC CACCGCCGCG ATGGTCGAGC TTTCCGCCGA CGACCGGAAA CTTCTGAACG CCGTCCGGCA CGCCATCACC AGCGGGGATC TCAACACCGA CCCGAACGGC TACGCCATCT ACCGCCGCGT GATGGGCGGC CGAGGTGACA AGACCCGCGC CTACCGCATC GCCACCGCCG TCAACGGCTG GCGACCCGAC CTGCACGCCG TCGCCTAA
|
Protein sequence | MTAMEFTRHG ALLVPAGTLP AEAGEIDRPA APDPVSAPVA APQVPARPSR VERRRARRDA DARAARAAVL AGETKKVQRW TARAAEARHL RRLLLDPDVQ AVRLMRQRAR WSAMAWSALV FALLFTMVNV QRFAAGHADP WSPMWIVAWL VDPAFSILLV GLLIARGHLS AVRRRVTEPI VKHVEYGLLA ATAAMNVAPE LTQRFPGGIA EQVASVVLHL LVPLLAFAAA SVITLIQDHF AAAIAALTNP PDGGLPTGIN LHEHPDTTAA MVELSADDRK LLNAVRHAIT SGDLNTDPNG YAIYRRVMGG RGDKTRAYRI ATAVNGWRPD LHAVA
|
| |