Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4839 |
Symbol | |
ID | 5707744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5488888 |
End bp | 5490252 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274235 |
Product | hypothetical protein |
Protein accession | YP_001539580 |
Protein GI | 159040327 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000509999 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCAGACCC AACGCGACCA CGTCCACGCC CACACCTTCA TGATGGGCCG GCTGAGCTCG GCCCTGGTGT TGGGTGACCC GACCGGGGCC GAGATTCCCG GCCGCCGCGC GCAGACCGGC CTGCTGATCG GGATCATCCT GGCGCTGCTG GTGTCCGGCG GCTTCGCCGT GTACGGGTGG ATAGTCCCCG GAGGAAGCAC CGCCTATCGG CAGGCCGGGG CGATCCTGGT GGAGAAGGAG ACCGGCAACC GCTACGTCTA CCTCAACGGG CTGCTGCACC CCACCCCGGA CCTGACCTCG GCGATGCTGA TCCAGGGCCC CTCCGCCGAG GTCACGTTGA TTTCGAAGAA CTCCCTCCGG GATGTGGCAC GTGGTGCCCC GCTCGGGTTG GCCGGCGCGC CCAGGCAGCT ACCGTCGACC GACGGGTTCG TACGGGGGCC GTGGCTCGCC TGCCTGCCCG GCTCTGTCGC CCCCGGTCGA TCCGTGTCCG GGCTGGGCAT CAACCTCGAT CCCGGGGTAG CCGCCGACCC GTTGCCGGCG GACCGGTTCG TCGTCGTACG TGACCAGCGT GACGTGGCCT ATCTGCTCGC CAACGGGGTG AAGTATCGGG TCGACGACGA GGCGGTGCTG GTGGTGTTGG GTGCCGCCAC CGTCAGCCCG GCCCCGGCCC CGCAGTTGTG GCTGGATTGG CTGGACGACG GGCCCGCCCT GGCGCCCGCC CGGATCGAGG GCGCCGGTGC TCCCGGTCTG CAGGTGGGTG GGCGTGCCCA CCCCGTCGGG ACGCTCTTTC GTCAGCGGGT GGAGTCCGGC TCCGAGCAGT TCTTCGTGCT GCGCCGGGAT GGGCTCGCAC CGATGAGCCG GACGGAGTTC CTGCTGGCCG ACGCCAAGGA CGAGGACGCT GCGGTCGAGC TGAACCCGGC GGCGATCGTC GACGCTCGGC GCTCCGCCGA CCGCTCGCTG CTGGACCGGT TGCCCGACCT CACGCCGCTG CGGCTGCTGG ACACCGCCGG ACGTGCCCTG TGTGCGCGGC AACGCCCGGT CTCGGCCGAG GAGTACGCCA GCGAGGTGGT GCTGGTACCG CAGCCGGCAG CCGCCATGAG CGCGGACGGC ACGCCGCTCG TGCTGACCCG TCCCGGGGCC GGGATGCACG TGGTCGCCGC CCCCGTGCCG GCGCAGACCG CCACCGCACA CACCTTCGTC ATCTCCGACG ACGGCATCGC CTACCGTCTC GCGGACCAGG CCACGAGGTC CGCGTTGAAG CTGGGCACGG TCGCGCCCAT ACCGTTTCCG AAGGACCTGT TGGCGGCAAT GCCGCAGGGA GCCGTGCTGA GTCGTGAGGC TGTCACAAGC CTGCCGAGGG GGTAG
|
Protein sequence | MQTQRDHVHA HTFMMGRLSS ALVLGDPTGA EIPGRRAQTG LLIGIILALL VSGGFAVYGW IVPGGSTAYR QAGAILVEKE TGNRYVYLNG LLHPTPDLTS AMLIQGPSAE VTLISKNSLR DVARGAPLGL AGAPRQLPST DGFVRGPWLA CLPGSVAPGR SVSGLGINLD PGVAADPLPA DRFVVVRDQR DVAYLLANGV KYRVDDEAVL VVLGAATVSP APAPQLWLDW LDDGPALAPA RIEGAGAPGL QVGGRAHPVG TLFRQRVESG SEQFFVLRRD GLAPMSRTEF LLADAKDEDA AVELNPAAIV DARRSADRSL LDRLPDLTPL RLLDTAGRAL CARQRPVSAE EYASEVVLVP QPAAAMSADG TPLVLTRPGA GMHVVAAPVP AQTATAHTFV ISDDGIAYRL ADQATRSALK LGTVAPIPFP KDLLAAMPQG AVLSREAVTS LPRG
|
| |