Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5041 |
Symbol | |
ID | 5707312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5708162 |
End bp | 5710039 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274434 |
Product | hypothetical protein |
Protein accession | YP_001539775 |
Protein GI | 159040522 |
COG category | [R] General function prediction only |
COG ID | [COG4880] Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.025652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATCGC GCCGTCCCAC CGTCGTCGCG GGCACGCTGC TGGCGCTCCT GCTGCTCGCG GGCAGCGCCG CCGCGACGCG TCCACCGGCA CCGAGCAGGC CGCCCGCCGG GCCGGGGTCA CCGATCCAGT TGGTCTCCTT CACCTCCTGC GCCGACGCGC TGGCCGAACT CCGCGCCGTG ACCACCGCCG CCGTCAATCC GCGAGGGCTC CCCGGCGAGG CCCTCCCCCT TTCGTCGGGC CCCACCGACG ACGTCGCCAA GTCGGCGCCA GCCAGCGAGC ACTCGGTCAC CAACAGCTAC GAGCCCGGCG TCGACGAACC GGATCTCGTC AAGACCGACG GGCAGCGAAT CGTCATGCTC AGCCAACAGG GTGTGCTTCA TGTCGTTGAC CCCGTCACCA ACCGGTTCAC CGGGAGGCTG AACATCAGCC GTGCCTCTTA CTGGGGTCGG TACGACCTGC TCCTGCACGG CGATCACGCC CTGATCCTCA CCGACGCGGA ACTTATGGTC CGCCCAGCGG TCGACGCCGG TGGCGATGAA ACTGCCAGCG AGCCGGCCGG AATGAGCTTT CACGCCGAGC CCTCCACGAC CCGACTCCTC CTGGTCGACC TGAGCGGCCC ACCCCGGGTG CTGGGCACAT ACAAGATCCG AGGCCGCACC GTTGACGCCC GGCAGACCGG GAGCACCGTC CGGGTGGTGG TCCGGTCTCA CGCTCAGGTG CCCTTCCCGG AGCTGCCCGC CACCGCCGAC GAGGCGGCCC GCGAGGCGGC CAACCGGGCC GCGGTGGCTA CCGCGGGCAT CGAGGCGTGG CTGCCAACCT ACGAGTGGAC GGCCGGAACG CAAAAGGGGA GCGGTCGAGT CGACTGCGAC CGGCTCAGCC GCCCGCAAAC CGGCACGGGC TCCACCATGC TGACCGTACT CAGTTTCGAC CTCACCGCCG ACCGGCTCAC CGACGGAAAC CCCGTCAGCG TGGCCGCCGA CGCGGACACC GTCTACAGCA CGGGCGGCAG CCTCTACCTG GTGGGCCAGC GATGGGTGGA GGTGCCGCCG GCCCCGGACC GACGGCCCGG CCAGATCGGC GAGGCGATCA CCGACATCTA CCAGTTCGAC ACCGCCGCTG CCGGCCGTCC CCGGTACGTC GCCGCCGGCA CGATTCCCGG CCGCCTGATC AACCAGTACG CGCTGTCGGA GTGGCAGGGC CACCTACGCG TCGCCACCAC CACAGGACAG GACGAACGCA CCTCGGAATC CGGCGTACAC GTGTTGCGCC GGCAGGGCGA CACGCTGACC CCGACGGGCG CGGTCACCGG CCTGGGCCCG GGGGAATGGA TCCGGTCGGT GCGCTATCTC GGCGACACCG CCTACGTGGT GACGTTCCGG CAGACCGACC CGCTCTACGC GCTCGACCTG AGCGACCACA CCGCACCCCG AGTCACGGGC GAGTTGAAGA TCACCGGCTA CTCGGCGTAC CTGCACCCGA TCGCGGACGG CCGGCTGCTC GGCATCGGGC AGGAGGCCGA CCTCGACGGA CGCGTACAAG GTGTCCAGGT CTCACTCTTC GACGTCCGGG ATCCGGCCCG ACCGCTCCGG TTGGATCACT GGCACCGCCC GAACGCCTGG TCCGTGGCCG AGCACGACCC GCACGCCTTC CGATACGACC CGAAAACCGG GCTACTCGCC GTTCCGGTCG ACGCCGGCCT GCGCCTGCTG CGGGTCTCCG GGGACACCCT CACCGACCGG GGCGAGGTGA CTCACCCGGA GGGGGTCATC AGCCGGTCGT TGCTCGTCGG TGACACGCTG TGGACGGTGT CGGACGTGGG CCTGCGGGCC AGCGACCCAA CGACCGGACG GAGCCTGGCC TGGCTCCCCA CCACCTGA
|
Protein sequence | MRSRRPTVVA GTLLALLLLA GSAAATRPPA PSRPPAGPGS PIQLVSFTSC ADALAELRAV TTAAVNPRGL PGEALPLSSG PTDDVAKSAP ASEHSVTNSY EPGVDEPDLV KTDGQRIVML SQQGVLHVVD PVTNRFTGRL NISRASYWGR YDLLLHGDHA LILTDAELMV RPAVDAGGDE TASEPAGMSF HAEPSTTRLL LVDLSGPPRV LGTYKIRGRT VDARQTGSTV RVVVRSHAQV PFPELPATAD EAAREAANRA AVATAGIEAW LPTYEWTAGT QKGSGRVDCD RLSRPQTGTG STMLTVLSFD LTADRLTDGN PVSVAADADT VYSTGGSLYL VGQRWVEVPP APDRRPGQIG EAITDIYQFD TAAAGRPRYV AAGTIPGRLI NQYALSEWQG HLRVATTTGQ DERTSESGVH VLRRQGDTLT PTGAVTGLGP GEWIRSVRYL GDTAYVVTFR QTDPLYALDL SDHTAPRVTG ELKITGYSAY LHPIADGRLL GIGQEADLDG RVQGVQVSLF DVRDPARPLR LDHWHRPNAW SVAEHDPHAF RYDPKTGLLA VPVDAGLRLL RVSGDTLTDR GEVTHPEGVI SRSLLVGDTL WTVSDVGLRA SDPTTGRSLA WLPTT
|
| |