Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1408 |
Symbol | |
ID | 5707685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1628677 |
End bp | 1630311 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641270918 |
Product | hypothetical protein |
Protein accession | YP_001536299 |
Protein GI | 159037046 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000194543 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTCGCT TCCTCGCCCT GACCACGCGC GCCCTCAGCC TGCTGGGGGT AGCCGCCGTA GCTGTTGGAT TGGTCCTGGT CATGGTGCCG CCGCCGGCCG CGGCGGCGGG TCAGAAGCGC TGCACCGTCA CCGATGACCG GCTCCGGGAG CTGTCCGGGC TGGTGGCCAC GAAGACCGGC TACATCGTGA TCAATGACGG CAGCGAGGTG GAGAGCCACA AGCGGGTCTT CTTCCTCGAC GACGAGTGCG AGATCGCCGA GGAGCCGGTC CGCTACTCCG TCGGTGGCCC GCTGGACACC GAGGACCTGG CGTTGTCGCC GGACGGGAAG ACCCTCTGGA TCGCGGACAC CGGCGACAAC GTGACCAGCA GCAACCGCCG TGAGCGGGTG GCGGTGTGGA GCATGCCGGT GAGCGGCGAG GAAAGGCCGG TGCTGCACCG CCTCGCCTAC CCGGATGGTC AGCCGCGGGA CGCCGAGGCG CTGCTCATCG ACGATGACGA CGCCCCACTG ATCATCACCA AGGTCGCCTC CGGCAAGGCC GAGATCTTCA CCGTCGACGG CGAACTCCCC AGTGGCGACA CCGCGCCAGG GCGGATGAAG AAGCTCGGCG AGGTCGAGCT ACCGGAGACC GAGACCGAGC ATGCGCTCGG GGTGATCGGG CGTACGGCGA TCACCGGGGC CGCCCGTTCA CCAGACGGAT CTCGGGTGGT GCTGCGCACC TACGCGGACG CATTCGAGTA CGACGTCGCC GACGGCGACA TCGTACGGGC GCTGACCACG GCGGAACCGC GGGTGACTCC ACTCGAGGAT CCGTTTGGTG AGGCGATCGC GTACACCCCG GACGGCACGA CCTTCCTCAC CGTCTCCGAC GGCGGCCAAC TCGACGACGG CGAGCCGATC GACATTCTGG CGTACTCGCC GACGGCGGAC ACGAAGGTGG GAGCGGAGGC GAACGCCGAC GGGTCGGAGG AGTCGTCGGC CTCCTGGTTC GCCAGTCTGT CCCCGCAGGA GGTCGCGTAT CTGATCATCG CTGCCGTGGG CCTGATCGGC GTGGCGTTGG TCGGGGCGGG TGTGGTCGGC GTCGTGCGGG CTCGGCGGCG ACCGGCCGGG GGTACGGGGG CCGGTGGGTC GGGCCCGGGT AGGTCCGGCG CAGGTAGGTC CGGCGCCGAC CACGGTTCTC GCGGCTCCAC CGACCGTCCC CGCGGCTCCG TCTATGGTGG CGCGCCTGCC GCCGGTGCCA ACGGCGGGCG TCGGGATCCA GGTCGGGCGG CGGTTCCGAC CGACACGAGC GCGAACGGCG GTGGCCGTCC GGCGGACGGC GGTGGTCCCG CTGCCGGCGT CTACGGCAGT CGGTCGTCCG ACGGTGGCGG TGGCGGCGCT CACCCGGCGG CGGCCGGGGG CGGTCGGCCT GAGGGCGGGC GTGCGACCGG CGGCGGGCGT GGCGGCGGCG TGTACGGCGG GGGAGCCGGT GGTGGCGTGT ACGGCGGCGG TCGTGCCCAG CCGCCGCGCC GCGACGGCGG CGAACCCCCT CCGCGTGCCG GGCGGCGCCG CGAGAACGGA CCGGCCGAGG GCCGGCCCGG CAGTCACGGC CCGGCGACCG GCGGCCGACG TGGCCACTAC GGCGACCGCT ACTGA
|
Protein sequence | MRRFLALTTR ALSLLGVAAV AVGLVLVMVP PPAAAAGQKR CTVTDDRLRE LSGLVATKTG YIVINDGSEV ESHKRVFFLD DECEIAEEPV RYSVGGPLDT EDLALSPDGK TLWIADTGDN VTSSNRRERV AVWSMPVSGE ERPVLHRLAY PDGQPRDAEA LLIDDDDAPL IITKVASGKA EIFTVDGELP SGDTAPGRMK KLGEVELPET ETEHALGVIG RTAITGAARS PDGSRVVLRT YADAFEYDVA DGDIVRALTT AEPRVTPLED PFGEAIAYTP DGTTFLTVSD GGQLDDGEPI DILAYSPTAD TKVGAEANAD GSEESSASWF ASLSPQEVAY LIIAAVGLIG VALVGAGVVG VVRARRRPAG GTGAGGSGPG RSGAGRSGAD HGSRGSTDRP RGSVYGGAPA AGANGGRRDP GRAAVPTDTS ANGGGRPADG GGPAAGVYGS RSSDGGGGGA HPAAAGGGRP EGGRATGGGR GGGVYGGGAG GGVYGGGRAQ PPRRDGGEPP PRAGRRRENG PAEGRPGSHG PATGGRRGHY GDRY
|
| |