Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3938 |
Symbol | |
ID | 5703675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4481329 |
End bp | 4482522 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641273363 |
Product | hypothetical protein |
Protein accession | YP_001538719 |
Protein GI | 159039466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.789456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.200985 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCTG ACCTGTACCC GGATCAGACG GCGTTGCGGG TGCCGCAGGC GGGTCCGTCC GACGCGGTCT CGGCGGTGGC GCGGGCGTCG GATCGGGGGG CACGGATCGG CGTCGTGGTC ATCGCGCTGT CCTGGCACTG CACCATCGGG CTACCCGCAA CGGAGAGGAT CCGGGGCGAG TTGACCGCGC CCAGTGTGGT GCTCGGTACC TGGCTCCTGG TCACCGTCAC CGGCGTGGTC ACCGGTGTCC GACTGCTACG CGGTAGGCCG CTACCGGCCT GGCCACTGGC CGCGCTGCTG CTCGTCGTGG ACGTGACGGT GTTCGCCGCG GTCGGCCGGG AGCACATGTT CAGCAGCGTC AACTGGGTGC GGGGGACGCT GGGCTGGTTC TTCGTCCTGA TCCTGTGGGA ACGACGGATG ACCGCGCTGC TGGGCATGCT CACTGCGCAC GCCCTGATCG CACTGGCCGC GCTGCTCGCC TACGGGATGA CCACTGCCGC GGACCTCGCC CGCTACACGA TGCACGTCTA CGGAGTCTCG TCGCTGCCGG TGGCGGTCGC CGCCGGCAGC GCCGCGCTCG CCACTCTGGC CCGAAAGCGC GCCCAGGTGG CCGCCACGGC GCATGCGCTG GCAGCTGAAC GGGAGGCCGC CGAGCAGGTC CGACAGGAGC GGCGGGACCG GTTGGGCCGG GCGGGCGAAG CCGCCCGCGA GGTGCTGGCC GAGCTGGCCG ACGGCCGGGC CGACCCGGCC GATCCGGCCG TACAGCGCCG ATGCGTCCTG GCGGCTGCCC GGCTGCGCCG ACTCATCGCC GAATCAGACG ACGTACCCGA CCCGCTGCTG CACGAGTTGC GGGCCGCAGC CGACTTGGCC GAACGGAACG GGCTAGCGAT CAGCCTGGTG ACCATCGGTA CCCCACCACC GCTGCCGGTG CGGATCCGCC GCCGGCTGGC CGACCCGCTG ACCGCCGCGC TCGCCGAGGC GCGGGACTGG GCTCGGCTGA CCGTGGTGGC CGGCCCGGAC GAGGTGGCCG TCAGCCTGGT CACCCCGGAT CGCCGGGAGG ACCCCATTCG GTCTGGCGAC GACAACGGGG ACAGCGACGA AGGGGACAGC GACAGCGAAG GAGACGGTGG GGTGCAGCAC CTCGACGAAC GGGACGGAAA GATCAGATGG ACGCAGACCC GGTGGCGGCG GTGA
|
Protein sequence | MSADLYPDQT ALRVPQAGPS DAVSAVARAS DRGARIGVVV IALSWHCTIG LPATERIRGE LTAPSVVLGT WLLVTVTGVV TGVRLLRGRP LPAWPLAALL LVVDVTVFAA VGREHMFSSV NWVRGTLGWF FVLILWERRM TALLGMLTAH ALIALAALLA YGMTTAADLA RYTMHVYGVS SLPVAVAAGS AALATLARKR AQVAATAHAL AAEREAAEQV RQERRDRLGR AGEAAREVLA ELADGRADPA DPAVQRRCVL AAARLRRLIA ESDDVPDPLL HELRAAADLA ERNGLAISLV TIGTPPPLPV RIRRRLADPL TAALAEARDW ARLTVVAGPD EVAVSLVTPD RREDPIRSGD DNGDSDEGDS DSEGDGGVQH LDERDGKIRW TQTRWRR
|
| |