Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2066 |
Symbol | |
ID | 5703277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2365863 |
End bp | 2366879 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641271552 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001536923 |
Protein GI | 159037670 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.623235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.45327 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCAAGAC GACGAGTTGT TGTGGTGGTC TACCCCGAAG TCCAGGCACT GGATGTCACC GGCCCGGTGG AGGTGTTCGA CACCGTGAAC CGGTTCCTGC CGGACCCGGC GGCGGGGTAC CGGATCGAGT ATGTCTCGGC TGCGGCCCCC TTGGTGCGTA CCTCGGCGGG TTTGATGATC CAGGCTGCGC CGTTGGAGAC AGGCGAAGGG CGAATGGACA ACCTGCTGGT ACCGGGCGGC TGGGGTCTGG GCCAGGCGCT CGCGGACCAC GATCTGCTGT GCTGGATCCA ACGGGCCGCG AAACGGGCGC AGCGGGTCAC GTCAGTGTGC GGCGGGTCCT TTCTGCTGGC CGAGGCCGGG CTGCTCGACG GCCGTCGGGC GACGACACAC TGGGCGTACT GCCAGGACAT GGCCCGGCGA TATCCGGCGG TGACTGTCGA CCCCGAACCC ATATACGTGT GGGACGGACC CTACGTGACA TCGGCGGGAG TGACTGCGGG AATCGACATG GCCCTGGCGC TGGTGGAGGC CGACCATGGT GCCGAGTTCG CCCTGGAGAT CGCCCGCTAC CTGGTGCTCT TCTTCAAACG CGACGGCGGC CAGCCGCAGT TCAGTGGCAT GCTGGACGCA CAACTGGCTG ACCGGGTGCC GATCCGAACC GCCCAAGAGT GGGTGCGGGC CCACGTCGAG CACCCACTTC CGGTGCCGGA ACTCGCCGAG CGGGTGCACA TGAGCCCCCG GAACTTCTCC CGGGTGTTCC GGCGAGAGGT CGGCATGACA CCCGGACAGT ATGTCGTCCA GACGCGCGTC GGTCGAGCCC GGGAACTGCT GGAGAGTACC GACCTGTCCA TCAGCCAGAT CGCCCGCCGA TGCGGCTTCG GTCGGGTGGA GACGTTCCTA CGGACGTTCG ATCGCGCGGT GGGTCTGACG CCGGGAGCCT ATCGGCAACG GTTCCAGGTC CTGGCACCAG CCGGTCTGCT GATCGAGCCA CCGGTAGCGG CGGGGAGCCA GGGGTGA
|
Protein sequence | MSRRRVVVVV YPEVQALDVT GPVEVFDTVN RFLPDPAAGY RIEYVSAAAP LVRTSAGLMI QAAPLETGEG RMDNLLVPGG WGLGQALADH DLLCWIQRAA KRAQRVTSVC GGSFLLAEAG LLDGRRATTH WAYCQDMARR YPAVTVDPEP IYVWDGPYVT SAGVTAGIDM ALALVEADHG AEFALEIARY LVLFFKRDGG QPQFSGMLDA QLADRVPIRT AQEWVRAHVE HPLPVPELAE RVHMSPRNFS RVFRREVGMT PGQYVVQTRV GRARELLEST DLSISQIARR CGFGRVETFL RTFDRAVGLT PGAYRQRFQV LAPAGLLIEP PVAAGSQG
|
| |