Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4200 |
Symbol | |
ID | 5704200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4767459 |
End bp | 4768727 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641273619 |
Product | hypothetical protein |
Protein accession | YP_001538972 |
Protein GI | 159039719 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0938646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00891245 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCATCCG TCACCCCCGA CCAGCCCGGC CGTCGCGCCC GGACGGGCGC ACGTCCGCCT GGGCGACCTG GGCCGGCTCG CCGGGTACCC CCGCCACGGG CGGGCGGGCC GCCGAGCCGT CCGCCGCTGG CCGTCGCCGC AGGGGCCGCC GCTGTCGGAG CCGCACTCAC CTCGTGGCTG CCGGTAGCGG TGGTGCTCTG GTTCTTCCAG CTCAGTGAGA GTGCGGCGAC ACTGTTGGGA GTGGTCCGGA TCGGGCTGGC CGGTTGGCTG CTCGGACACG GCGTACCCCT GGTGACGGAC GTCGGTCCAC TCGGGCTCGC TCCGCTGGCC GTGACCCTGC TCGCCGGCTG GCGCCTCACC CGTGCCGGGG TGCACGTCAC CCGAGCTATC GGCGCGCGTG GCAGCCGTTC GCCGAAGCGC GCGGTCCTCG CCGCGGGCGC GGTCGGGTTC GCGTACGCCG TCCTTGGAGT GTCCGCTGCC CTGCTGGTCA CCACCGGGGA ACCGGCCGTG TCGCCGGTGC GGGCCGGCCT GACCCTCGCC GTGGTGGGAA CGGTCGCCGC CCTGGCCGGC GCAGTTCGTA CGACCGGACT CGGTGACCAG TTCGCCGAGC GGGCGCCACT ACCGCTGCGG GAGGGGATCC GCACCGGCCT GGTCGCGGCC CTGTTGCTGC TCGCGGCGGG GGCTGGCATG GCGGGGCTGG CGGTGGCGAC CGGCGGCGGT GATGCGGCCG ACCTGATCGG GAAGTACCAC ACCGGGGTGG CCGGGCAGGC CGGGATCACC CTGGTCAACC TGGCGTATGC CCCGAACGCG GCGGTCTGGT CGACCAGCTA CCTGCTCGGT CCCGGGTTCG CGGTCGGCAC CGACACCACG GTACGGACCA GCGAGGTGAC GGTGGGGGCG TTGCCGGCCC TACCGTTGGT CGCCGGCCTC CCCGGTGGGC CGGCAGACGG CCTCGGCGCG GGTCTGCTTG CGGTGCCGGT CCTGGTCGGG ATGGTGGCGG GCTGGCTGTT GACCCGCCGG GTGTTGCGGC TCGTCGACGA GGGCGCCCGG CGACAGTGGG GGCCGCTCCT GCGGCCGGCG GCACTCGCCG GCCCGGTAGC GGGCCTGCTG GTGGGACTCG CGGCGGCGGC GTCGGCCGGT TCGCTGGGTG CTGGCCGGCT GGCCGAGGTA GGGCCGGTGC CGTGGCACGT GGCGGCCGTG GCGACCGCGG TGACCGGGGC GGGTGTGCTG GGTGGCGTGG TCGCGGCCCG TTTCCTGTCC CGTGCCTGA
|
Protein sequence | MPSVTPDQPG RRARTGARPP GRPGPARRVP PPRAGGPPSR PPLAVAAGAA AVGAALTSWL PVAVVLWFFQ LSESAATLLG VVRIGLAGWL LGHGVPLVTD VGPLGLAPLA VTLLAGWRLT RAGVHVTRAI GARGSRSPKR AVLAAGAVGF AYAVLGVSAA LLVTTGEPAV SPVRAGLTLA VVGTVAALAG AVRTTGLGDQ FAERAPLPLR EGIRTGLVAA LLLLAAGAGM AGLAVATGGG DAADLIGKYH TGVAGQAGIT LVNLAYAPNA AVWSTSYLLG PGFAVGTDTT VRTSEVTVGA LPALPLVAGL PGGPADGLGA GLLAVPVLVG MVAGWLLTRR VLRLVDEGAR RQWGPLLRPA ALAGPVAGLL VGLAAAASAG SLGAGRLAEV GPVPWHVAAV ATAVTGAGVL GGVVAARFLS RA
|
| |