Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4719 |
Symbol | |
ID | 5706021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5342098 |
End bp | 5343039 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641274117 |
Product | hypothetical protein |
Protein accession | YP_001539463 |
Protein GI | 159040210 |
COG category | [S] Function unknown |
COG ID | [COG5495] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000278675 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGCTT CGCTGCGTCC GCGTCCCGCC GAACGGCCGG TGGACGCGCC TACCTCTCGA TTCCTGACCG TCGGCGTGAT CGGTGCCGGC CGGGTCGGCG CCGTCCTGGC CGCTGCCCTC GCCGCCGCTG GGCACCGGGT GGTCGCCGCC GCCGGGGGCT CCGACGCGTC GCGCACCCGG ATGGCGCTGC TGCTGGCCGA CGTGCCGCGC CGGTCGGCCA CCTCGGTGGC GCGGACCGCC GCAGACCTGC TGTTGGTGGC GGTGCCGGAT GACGCGCTCG CGGGAGTGGT CGCCGGCCTT GCCGACAGTG GGGCGCTGCG TCCGGGCCAG GTGGTCGCGC ACACCTCCGG CGCGCACGGC CTGGCAGTGC TGACACCGGT CACCGAGGCC GGGGCCGGCC CACTGGCCCT GCACCCGGCG ATGACCTTCA CCGGTACGTC AGACGACCTC ACTCGGCTGC CCGGCATCTC GTACGGGCTG ACCGCCCCGG CGCAGCTGCG TCCCTTCGCC ACCCGTCTCG TCGCCGACCT CGGCGGCGTA CCGGAGTGGA TCAGCGAGGC GGACCGGCCG CTCTACCATG CGGCCCTCGT GCACGGTGCG AACCACCTGG TCACGCTCGT CAACGAGGCG GGTGATCGGC TGCGCGACGC CGGGGTGACC AGGCCGGGGC GGGTGCTCGC CCCGCTGCTG CGGGCCGCCC TGGACAACGC GCTGCGGCTC GGTGACGACG CGCTGACCGG GCCAATCTCC CGGGGCGACG CCGGCACCGT CGAGCGGCAC CTGGCTCGGC TGGCGGCAAC CGCGCCGGAA TCGATGGGCC CCTACCTGGC GTTGGCTCGA CGGACGGCGG ATCGAGCGAT CGCAGCCGGC AGGCTACGTC CGACAGACGC GGAGGCGCTG CTGGGCGTGT TGAGCCGGCG GGGAGGGGAG GCGGCCGGGT GA
|
Protein sequence | MSASLRPRPA ERPVDAPTSR FLTVGVIGAG RVGAVLAAAL AAAGHRVVAA AGGSDASRTR MALLLADVPR RSATSVARTA ADLLLVAVPD DALAGVVAGL ADSGALRPGQ VVAHTSGAHG LAVLTPVTEA GAGPLALHPA MTFTGTSDDL TRLPGISYGL TAPAQLRPFA TRLVADLGGV PEWISEADRP LYHAALVHGA NHLVTLVNEA GDRLRDAGVT RPGRVLAPLL RAALDNALRL GDDALTGPIS RGDAGTVERH LARLAATAPE SMGPYLALAR RTADRAIAAG RLRPTDAEAL LGVLSRRGGE AAG
|
| |