Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2121 |
Symbol | |
ID | 5704975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2443800 |
End bp | 2444750 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641271606 |
Product | hypothetical protein |
Protein accession | YP_001536977 |
Protein GI | 159037724 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00827277 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0571528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTTG CCGCACCCCG GCGCGATCAT CGACTGACCA GTCAGTCGGC GGTGCTGCTC GCTCGGATAC GGCGTGGAGG TCAGATGGGC CAGATGGTGC GGATCCCGGG CGGGATGTTT CTTCAGGGGT CGCCGCCCTG GCTGATCGAC TGGCTCGACC AGGCAGACCA GCCGCTACCC CGGACATGGT TCGCCGACGA GACGCCGCAG GTTTCCCGAC GCCTACCTCC GTATCGGATG GATCGGCACC TGGTGACGGT GGCTGACTTC CGGCGGTTCG TGCGCGCGAC CGGGTACCGA ACCGACGCCG AGCGGCGCGG CTTCGGACTG GTATACGAGG CGCTGGGCTG GGTGGAGCGC GACGGTGTCT GCTGGCACTC GCCAGGTGGC CCGGACACCG GGTCGCCCGG GTACGACGAC CATCCCGTGG TGCACGTGTC CTGGGAGGAC GCCAACACCT ACGCGCAGTG GGCCGGTAAA CGCCTGCCGA CCGAATCCGA GTGGGAGTTC GCCGCCCGTG GGTCCGGCTT CCGGATCTGG CCCTGGGGCG ACACCTGGCA GGTCGACCAC GCCAACACGG CGGAGCTGCA TGCCGGCCCG CTGAACTCGC TCGCCGCGTG GCGCGAATGG TGGCAGGCGA TGTGCGAGAA ACACGGACCG GTGCCACACA CGACGCCCGT CGGTATGTTC TCTCGCCATG GCGACAGCGC TCTCGGGTGC GCCGACATGG CGGGCAACGT GTACGAGTGG ACCTCGACGC TGTCGGAGCT CTACGACGAA GGCGTGGTCT GCGACCCCAC GGTACGAATG GCGATGGGTC GGTACCGGGT GATTCGGGGT GGGTCGTGGA TGAACTTTCG CTACCAGGTT CGCTGTAGCG AGCGGATGCA CGGCGACCCC ACTGGGTGGT CGAGTTTCGC GCACGGCTTC CGCTGCGCGC AGGACGAGTG A
|
Protein sequence | MTFAAPRRDH RLTSQSAVLL ARIRRGGQMG QMVRIPGGMF LQGSPPWLID WLDQADQPLP RTWFADETPQ VSRRLPPYRM DRHLVTVADF RRFVRATGYR TDAERRGFGL VYEALGWVER DGVCWHSPGG PDTGSPGYDD HPVVHVSWED ANTYAQWAGK RLPTESEWEF AARGSGFRIW PWGDTWQVDH ANTAELHAGP LNSLAAWREW WQAMCEKHGP VPHTTPVGMF SRHGDSALGC ADMAGNVYEW TSTLSELYDE GVVCDPTVRM AMGRYRVIRG GSWMNFRYQV RCSERMHGDP TGWSSFAHGF RCAQDE
|
| |