Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2157 |
Symbol | |
ID | 5705613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2480680 |
End bp | 2481720 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271642 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001537013 |
Protein GI | 159037760 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.256833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00609835 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGAGCGC ACGTCATTGC GGTGGCGGTG ACCGACAACC TGCCTATCTT CGAGCTGGCC GTGCCGCAGG AGGTGTTCGG CACCGACCGC CGGGACATCG CCGATCCGTG GTACGACATG CGCCTGTGCG CGGCCGAACC GGGCCCACTG CGCACCACCG GAGGCGGCTT CCTCAACCCG TCGTACGGCT TGGACGATCT GGTCGAGGCA GACACCGTGC TGGTGCCGGC GTGCGCGCGG GCAGCCCAGG TCAACCCACC GGCCGACCTG GTCGAGGCAC TCCGGGTGGC GCACGCGCGG GGCAAGCGGA TTGTCGGCAT CTGCACCGGC GCGTACGTGC TGGCCGCGGC CGATCTGCTC GACGGCCGCC GGGCAACGAC CCACTGGATG AACGCCCAGG ACTTCGCGGC CCGGTTTCCC CTGGTCGACC TCGACCCTCG GGTGCTCTAC GTGGATGAGG GCGACATCCT CACCTCCGCC GGGACGGCCG CCGCGATCGA TCTGTGCCTG CACCTGGTGT GGCGGGACCA CGGCGCGGCG ATCGCCCACG AGGTCGCCCG CCGGATGGTC GTGCCCCCGC ATCGGGGCGG TGAGCACACC CAGTACCCGT CCGCACCAGC GCGGAGCGTG CCCCCCGACG ACCTGAGCGC GGTGCTGGAA TGGGCCCGCG GCCGGCTTGA CCAGCCACTG ACGGTCAACG ACCTGGCGCG TGCGGCGAAC CTGAGCCCGC GTACGTTCGC CCGGCGGTTC CGCGACACGC TCGGGACCAC TCCGTTGCAG TGGCTACTGG AGCAGCGGGT CCGGCTGGCT CAGGAACTGC TGGAGACCAC GGACGAGCCG GTGGAACGGA TAGCTCACCG CACCGGCTTC GGTACGGGCG CCAACCTGCG CCAGCACTTC GGTCGGGTCA GCGGGGTGAC CCCCCAGTCC TACCGGCACG TGTTCCGCTA CCGCAACGCC GCGGCGGCAT CGCCGGTTGT CCACGACACC TCGGAGCATC CGGCGTTGGT GATCGCCCGC TCGGGCGGCG AGGCGAGGTG A
|
Protein sequence | MGAHVIAVAV TDNLPIFELA VPQEVFGTDR RDIADPWYDM RLCAAEPGPL RTTGGGFLNP SYGLDDLVEA DTVLVPACAR AAQVNPPADL VEALRVAHAR GKRIVGICTG AYVLAAADLL DGRRATTHWM NAQDFAARFP LVDLDPRVLY VDEGDILTSA GTAAAIDLCL HLVWRDHGAA IAHEVARRMV VPPHRGGEHT QYPSAPARSV PPDDLSAVLE WARGRLDQPL TVNDLARAAN LSPRTFARRF RDTLGTTPLQ WLLEQRVRLA QELLETTDEP VERIAHRTGF GTGANLRQHF GRVSGVTPQS YRHVFRYRNA AAASPVVHDT SEHPALVIAR SGGEAR
|
| |