Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3194 |
Symbol | |
ID | 5705629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3684477 |
End bp | 3685469 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272625 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001537992 |
Protein GI | 159038739 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.857429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00472645 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGATCA ACGAGCACTA CCGACTGGAC CCGAACGCCC GGGTGCTGCT GTCCGATCTG GGCCTGTCCG TCGGCAACAT CCTGCGCCGG GCCGGGCTGC CGGGAGACAC GCTGTCCGAC GGCCCGGCCA CCCTGACACC AGAGCGGTTC TACGCCCTGT GGGAGGCGGT CGCCGCCGAG GCCGCCGACC CCGGGCTCCC GATCCGGATC GGTCAGGCCA TCTCCGTCGA GGCGTTCCAT CCGCCGCTCT TCGCGGCGCT GTGCAGCCCG AACCTCGGCG TGGCCGCTGC CCGCATCGCC ACGTACAAGG CGCTCATCGG ACCGCTACGA CTCGTCATCG CTACCACCGG TGAAGGGCTC GAGGTGGAAC TGCACTGGCC GCCAGACCAC CGGCCCCCGG AAGTCCTGAC GACGACCGAG CTCGTGTGGT GGGTCGCCCT GGCCCGACTG GCCACCCGGA CGAGGGTGGT GCCGGTCGCC GTCACCAGCG CGCAGCCACC GTCCGCCGCC AGTGCACTCG CCGACTACCT CGGAGTTCGC GTACAGCAGA CCGAGCGTTT CACCGTGACC TTCAGCGCCC GGGACTCGGC CCGCCCGTTC CTCACCGCCA ACGAGCCGAT GTGGGAGTTC TTCGAGCCGG AACTCCGCAG CCGGCTCGCC CACCTGGAGC GCGGCGCCAC GGTACGCCAA CGGGTACAGG CCGCCCTGCT CGAACTACTA CCCAGCGGCC GGGGCACCGT CGACGGCGTG GCCCGTGAAC TGACGATGGG GGCCCGTACG CTGCAACGTC AGCTGAAGAG CGAAGGCACC AACTTCCAGA CCGTACTCAA CGACACCAGA CGATCGATCG CCCACCGCTA TCTCAGCGAG GGGAACCTCT CGGTGGCCGA GATCGCATTC CTCCTCGGCT ACGACGAACC CAGCTCGTTC TACCGTGCGT TTCACGCCTG GACCGGACGC ACCCCACTCG CCGCCCGAGC AGAACTCGGG TGA
|
Protein sequence | MSINEHYRLD PNARVLLSDL GLSVGNILRR AGLPGDTLSD GPATLTPERF YALWEAVAAE AADPGLPIRI GQAISVEAFH PPLFAALCSP NLGVAAARIA TYKALIGPLR LVIATTGEGL EVELHWPPDH RPPEVLTTTE LVWWVALARL ATRTRVVPVA VTSAQPPSAA SALADYLGVR VQQTERFTVT FSARDSARPF LTANEPMWEF FEPELRSRLA HLERGATVRQ RVQAALLELL PSGRGTVDGV ARELTMGART LQRQLKSEGT NFQTVLNDTR RSIAHRYLSE GNLSVAEIAF LLGYDEPSSF YRAFHAWTGR TPLAARAELG
|
| |