Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3018 |
Symbol | |
ID | 5707359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3431337 |
End bp | 3433148 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641272465 |
Product | hypothetical protein |
Protein accession | YP_001537833 |
Protein GI | 159038580 |
COG category | [S] Function unknown |
COG ID | [COG4529] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0249937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTGCC TGGTGGGGGC TGGTCCCAGG GGACTCGCCG TCCTGGAGCG ATTGTGCGCC AATCATGCTG GCGAGAACGA GCTGGTGATC CACGTAGTGG ATCCCTTTCC GCCGGGGTCA GGGAGGATCT GGCGGGAGCA GCAGTCACCA CAACTGCTGA TGAATACGGT GTCGTCCCAG GTCAGCCAGT TCACCGATGA GAGTATCGAC TGCTCGGGTC CGATCAGGCC GGGCCCGAGT TTGCACGGTT GGCTGCAAAG CCACGACTCT GATGCTGACC AGGGGCGACG GGGCCCTGAC GACTATCCCT CTCGGAGATT GTACGGCCGC TATCTGAGAT GGGTGTTTGA TCGCGTGGTA GCCGATGCGC CTGACACCGT CCGCGTCGTT GTCCATCGGG CCACAGCGGT GGCGTTGGAG GACGCGGACA ACGGCCAGTG CGTCACCCTG GATGACGGAA ATCGACTGGC GGGGATGGAC GCCGTGGTTC TGTCACTCGG ACACAGCGAT ACGGAGCTGA CCGGCAAGGA ACGCCAGTTG GCGGGTTTCG CCGAACGTCA CGGCCTGCGT TACTTTCCTC CCGCAAACCC AGCCGATCTC GATTTCGATA AGATCGATGC CGGGGAGGCA GTCGGTGTCC GGGGACTCGG GCTCGCATTC TTTGACGTCC TGGCACTGGT TATGGAGGGT CGAGGCGGTC GGTTCGTGCC TTCCGACAAG GGTCTACGGT ATCGACCGTC CGGTCGCGAA CCAACTCTGT ACGCCGGAGC CCATCACGGG ATTCCCGACT ACGCGAGGGG GAGAAATCAG AAGGGTGTCG CCGGCAAGCA TCGTCCCCGG TTCTTGACTG CCGACGCCGC CCACCGTATT CGCGAAAACC CGGAGGCGAC GTTCCGGCGG GACGTGTGGC CCTTACTGGA TGCCGAAGTT CGTACGGTCT ATTATCAGGC ACTCGTCGCA CAGCGCGGCG GTTCCTCCGC CGCCAGCAAC TTCCTGAAAG ATTACCTTTC TGAACCCGAC GATCCAGGAA CACTTCGACG GCACGGCCTG ACCGCATCGG ACGAATGGAG TTGGGAGAGG CTCGGCCAGC CGTGGCGGCC CCACGAATTC TCTGATCATG CGACGTTCAA TCAATGGCTA CTCGGTCACC TCCGCGAGGA CATCGGCCAT GCTGAGGTCG GTAACGTCGA CGACCCGGTC AAAGCCGCCC TGGACGTCAT ACGAGACCTG CATAAAGAGA TTCGACTGGC AATTGACCGC TCGGGTGTCA TCGGATCGTC CTATCGCGAC GAAGTGATCC ACTGGTTCAC GCCACTGAGT ACTCTTTTTT CTGCTGGTCC GCCGCCCCTG CGGGTCGAAC AGATGGCGGC ACTTATCGAG TGCGGTCTGC TGCAGGTCGT CGGCCCGGAG CCCCAGGTGC GAACGGACCC TACCGGCGCC TGCTTCCTCA TCGGCTCCGC CACGATACCT GGAAAACAAA TACGGACAAC ATCATTGATT GAGGCGCGTA TCCCGAAACC AGACCTGAAG CACAGCGCCA ATCCGCTGCT GTGCTTCCTG GTCGAAACAG GGCAGTGTCG TCCATATCAT ATTCCGGACC CCGACGGTGC CTACGAAAGC GGCGGGCTTG ACGTCACCCC ACGGCCATAT CGCCTGATAG ACGCCGCCGG CGTCCCCCAT TCGCGCCGTT TTGCGTACGG ACCACCGACC GAGTCGGTTT TCTGGTTCCT GAACGAAACG ATTCGTCCCG GCATCGGCTC CATGATTCTC GAGGACGCGG ATGCCATCTC TCGAGCGGCG CTCACGTGCT AG
|
Protein sequence | MICLVGAGPR GLAVLERLCA NHAGENELVI HVVDPFPPGS GRIWREQQSP QLLMNTVSSQ VSQFTDESID CSGPIRPGPS LHGWLQSHDS DADQGRRGPD DYPSRRLYGR YLRWVFDRVV ADAPDTVRVV VHRATAVALE DADNGQCVTL DDGNRLAGMD AVVLSLGHSD TELTGKERQL AGFAERHGLR YFPPANPADL DFDKIDAGEA VGVRGLGLAF FDVLALVMEG RGGRFVPSDK GLRYRPSGRE PTLYAGAHHG IPDYARGRNQ KGVAGKHRPR FLTADAAHRI RENPEATFRR DVWPLLDAEV RTVYYQALVA QRGGSSAASN FLKDYLSEPD DPGTLRRHGL TASDEWSWER LGQPWRPHEF SDHATFNQWL LGHLREDIGH AEVGNVDDPV KAALDVIRDL HKEIRLAIDR SGVIGSSYRD EVIHWFTPLS TLFSAGPPPL RVEQMAALIE CGLLQVVGPE PQVRTDPTGA CFLIGSATIP GKQIRTTSLI EARIPKPDLK HSANPLLCFL VETGQCRPYH IPDPDGAYES GGLDVTPRPY RLIDAAGVPH SRRFAYGPPT ESVFWFLNET IRPGIGSMIL EDADAISRAA LTC
|
| |