Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4908 |
Symbol | |
ID | 5707424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5575820 |
End bp | 5577028 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641274303 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001539648 |
Protein GI | 159040395 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00382252 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCGAGC AGCAGCCAAC CCCCGGCCAA CGCGTCGAAC GGCTCCGTCG GGCAGCCGGC CTATCCCGCG AACGCCTCGC GGGACTCGCG GGACTCAGCG CGACCACCGT GAAGTTCATT GAGAACGGCC GGCGATCATT GACCTTGAGG GCGGCGCAGC AACTCGCCCC ACACCTCGGC GTGCGTGATC TCGGCGATCT ATTCGGCCCT CAGGTTCCCT TGTCTTTGGA TGCCCGACCC AGTCACCCCG CCGTTGACGA CGTTCGCAGA GCCCTCACTG CCTGGCAGGT CACCATTAGT GGTGAGCCCG AATCCACTGA CTACCTTCGT GGTGCGGTCG ACTCCGCCTG GCAGACGTGG CATACCAGCC GCCACCAACG CACCGAGGCC GGTCACCTAC TACCCGGGCT GATAGAGGCA ACTCAACGCG CCACCCGGCT GCACCACGGG GAGGAACGAC GCGCCTCACT GGCGCTGCTC GCCCAGGCGT ACCACCTTGC CCAGGCGTTC CTAGCCTGGC ACGGTGACCG TGAGTTGTGC TGGCTCGCCG TGGACCGGGG CATGACCGCC GCCCTGGACG CCGACGACCC ACTAGCCATC GCACAGTCGA TCTGGTATGC CGCTCACATA CTCCGCGCTG CAGGACGAGG AAGTGATGCC CTGGAGCGGC TGGGCGAGGC GCGATCGCTG ATCGAGCCAC ATGTGACTGA CGGTGGCGTC GAGTGGGCCG AGATGCTCGC CGACCTGCAC CTGTGTATCG CATTGACGAA GGCGCGGATG GGAGATCACG GAGCTTGGTC TGATTGGGAC ACCGCCCGCA CCGTCGTCGA CCGGGCGCTA CCCGCCGGGT TTGTCGGCCT ACGCACCCGG GTATCCCGCC CGTTGGTCGA CGTGTACGCG GTGATGTGCG CTGTGGACCT GGGTGACCCG GACGAGGCAC GGCGTCGCGC CCACGCCCTG GACCCGGCCT CTATCCCGTC GACCGAACGT CGCGGACGCC ACTATGTGGA GCTGGCGCGA TCGGCTGACC TGGAAGGGGC ACGCGAGGCG ACCCTACATT TGCTGACCAG GGCTGAGGCC ACCAGCCCGG AAACCGTGCG GTACTCGCCG GCAGCACAGG ACATGCTGGC ACGGCTCGCG CGTGAGGCCC CAGCGTCAGT GCGGGCGGAA GCAGTGGACC TAGCCCACCG ATTGGGAGTA GCAACCTAG
|
Protein sequence | MVEQQPTPGQ RVERLRRAAG LSRERLAGLA GLSATTVKFI ENGRRSLTLR AAQQLAPHLG VRDLGDLFGP QVPLSLDARP SHPAVDDVRR ALTAWQVTIS GEPESTDYLR GAVDSAWQTW HTSRHQRTEA GHLLPGLIEA TQRATRLHHG EERRASLALL AQAYHLAQAF LAWHGDRELC WLAVDRGMTA ALDADDPLAI AQSIWYAAHI LRAAGRGSDA LERLGEARSL IEPHVTDGGV EWAEMLADLH LCIALTKARM GDHGAWSDWD TARTVVDRAL PAGFVGLRTR VSRPLVDVYA VMCAVDLGDP DEARRRAHAL DPASIPSTER RGRHYVELAR SADLEGAREA TLHLLTRAEA TSPETVRYSP AAQDMLARLA REAPASVRAE AVDLAHRLGV AT
|
| |