Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1457 |
Symbol | |
ID | 5704168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1685105 |
End bp | 1686940 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641270966 |
Product | hypothetical protein |
Protein accession | YP_001536347 |
Protein GI | 159037094 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.425518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00949223 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCGACT CCGTGCCGCA GTCCACCGCT CCCGTGTCCA CTGTGGTGGA GCGGCCGACA CGATGGTCCC GGTGGCGGCC CGCCCGCGCC GACGTGTTGG CCATCGCCGT CTACCTGGCC CTCGGCGTAC TGGTCTGCCT CAACCACTGG GTGGACGTAC CCAACCGGGT GTCCGCCCAC CTGCCGACCG ACCACAGCTG GTTCGAGTGG CTCTTCTCCC ACGGGGCGTA CTCGGTGTGG CATCTGGAGA ACCCGCTCTT CACCACCCGG CAGAACGCCC CGGTCGGGGT GAACATGATG GCGAACACGT CACTACTCGG GGTGACCCTG CCACTGGCGC CGCTGACCAT GCTGGTCGGC CCACAGGTGA CGTACGTGTT GTACCTCGGC GCCGCGCTGG CCGCCACTGC CGGCACGTCC TACTGGATGC TCTCCCGGTA CCTGGTCCGT TCCCGGGGCG CGGCGTTCGT CGGCGGCGCG TTCCTCGGCT TCGCACCCGG CATCGTGCAC CACGCCAACG GCCAGCCCAA CTTCGCCTCC AACTTCCTGC TCCCGCTGAT CGTGGCGCGG GTACTGCGGC TGGGCGAGCC GGGCCGCTGG CGACGCAACG GCATCGTGCT CGGCCTGCTG GTGGCGTACC AGATCTTCAT CAACGAGGAG ATGCTGCTGC TCACGGCGCT GGCCTGCCTC GTCGTCGTCG GCACGTACGC GGTGCTGCGG CCGCGGGCCG CCGGAGGGCA GGCCGGCACC TTCGCCGCCG CTCTCGGCAC CGGCGGCGCC CTGGCCCTGT TGCTCACCGT GTACCCGATC TGGTTCCAGT TCAACGGCCC GCAGTCCTAC CGCGGCCTGC AGGGGGGCAC CTTCCACAGT TGGGGCGAGG ACCTGAAGGC GTTCGTCACG TTTCCCCGGG ACTCCCTGGC CGGGGACGAG GCGGTGGAGC GGACCATCGG CGTGACCGAG CAGAACACGT GGTTCGGCTG GCCGCTGGTG CTGCTGGCGG TGGTCGCTCT CCTGCTGCTG GTCGGCCGTT CCCTGGTCGC ACGCATCCTG GCGGTGCTGA CGGTGGTGTT CACCGTGGCC TCACTCGGCC CGGTGATCCG CTTCGACGGC GTCGAGACCG ACACCGACGG CCCGTGGGCG TACGTGCCGG AGGAGCTGCC GCTGGTCGAG ATGATGATGC CGACCCGGCT GAGTCTGATC GTGGCCGCCG CGGTCGGGGT TCTGCTCGCC GTCGCGTGGG ACACCCTGTC CGGATCCGGC CGGCCACCGG TACCCGCGCA GCGTGCCGCC CACCCACCCC GGCTCCAACG GCGGTGGCAG CGTCCGGTCG GCTACGCGGC CATCACGCTG GCCCTGCTAC CGCTGATCCC CCGACCGCTG CCGGTGAAGC CGGTGGAGCC GCCCCCGCAC TTCGTCACCG CCGGTGGCTG GCGGCCGTAC GTGCCCGAGG GGCGCACCCT GGTCCCGGTG CCGATCCCGA GTAACGTGCA CGGCCTGTCC ACGCTGCGGT GGAGTGCGCT GACCACGCAC GCGTTCCCCA TCCCCGGCGG GTACTTCATC GGCCCGGACG AGCAGGGTGC GGGGATCTTC GGGGCGGCGA ACCGGCCGAC GACCCGGCTG ATTTACACCA CGATGGACCG GAACGCCACG CCGAACGTCA CCGACGTCGA GCGCCGGCAG GCGGTCGAGG ACCTGCGGTA CTGGCGGGCC TCGGTGGTGG TGCTCGGCGC CCACCCCCGG GAGGCGGTGC TGCGCGACCT GATGACGTCG CTGCTCGGTG CGCCGCAGCG GGTGGACGAC GTCTGGCTCT GGGACGTCCG TTCCCTGGTC GGTTAG
|
Protein sequence | MVDSVPQSTA PVSTVVERPT RWSRWRPARA DVLAIAVYLA LGVLVCLNHW VDVPNRVSAH LPTDHSWFEW LFSHGAYSVW HLENPLFTTR QNAPVGVNMM ANTSLLGVTL PLAPLTMLVG PQVTYVLYLG AALAATAGTS YWMLSRYLVR SRGAAFVGGA FLGFAPGIVH HANGQPNFAS NFLLPLIVAR VLRLGEPGRW RRNGIVLGLL VAYQIFINEE MLLLTALACL VVVGTYAVLR PRAAGGQAGT FAAALGTGGA LALLLTVYPI WFQFNGPQSY RGLQGGTFHS WGEDLKAFVT FPRDSLAGDE AVERTIGVTE QNTWFGWPLV LLAVVALLLL VGRSLVARIL AVLTVVFTVA SLGPVIRFDG VETDTDGPWA YVPEELPLVE MMMPTRLSLI VAAAVGVLLA VAWDTLSGSG RPPVPAQRAA HPPRLQRRWQ RPVGYAAITL ALLPLIPRPL PVKPVEPPPH FVTAGGWRPY VPEGRTLVPV PIPSNVHGLS TLRWSALTTH AFPIPGGYFI GPDEQGAGIF GAANRPTTRL IYTTMDRNAT PNVTDVERRQ AVEDLRYWRA SVVVLGAHPR EAVLRDLMTS LLGAPQRVDD VWLWDVRSLV G
|
| |