Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4163 |
Symbol | |
ID | 5707712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4727708 |
End bp | 4729156 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641273590 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001538943 |
Protein GI | 159039690 |
COG category | [F] Nucleotide transport and metabolism [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [COG2169] Adenosine deaminase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.813535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00229992 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGACCTGG ACTTCGAGCG GTGTTATCGG GCCGTCGACA GCCGTGACCA GCGGTTTGAC GGCTGGTTCT ACACGGGCGT GACCTCCACC GGCATCTACT GTCGGCCGTC TTGTCCGGCG ATCACTCCGA AACGGGAGAA CATCCGGTTC TTTCCGTCGG CCGCCGCAGC GCAGGAGGCC GGGCTTCGGG CCTGCCGTCG GTGCCGGCCG GATGCGACCC CGGGCTCACC GCACTGGGAC GTCCGCGCCG ACGTGGTCGG TCGCGCCATG CGACTGATCG CCGACGGCGT GGTCGACCGG TCCGGGGTAC CCGGCCTGGC GGCACAGCTC GGCTACACAG AGCGGCACCT GCACCGGATG CTCCGCACCG AACTGGGGGC CGGCCCGCTC GCGCTGGCCC GCGCGCAGCG CGCGCAGACC GCGCGGACCC TGATCGAAAC CACCGACCTC GGAATGGCGG AGATCGCGTT CGCCGCCGGG TTCGGCAGCG TTCGGCAGTT CAACGACACG GTCCGCGAGG TGTACGCGGT TGCCCCGTCC GAGCTTCGAG CGGTCCGGAG CCGACGGACG TCCCCCGCCG GACCCGGAAC GATCACCGTA CGGCTGGCGT ATCGGCCCCC ACTGCATGTC GGGGCGCTGC TGGACTTCCT CGCCCCGCGG GCGCTGCCCG GTGTCGACGA GGTGCGCGCG GGGGCCTATC ACCGCGGCCT GCGGCTGCCA CACGGCACCG GCGAGGCTTC GCTGACTCCG ACGGACAGGC ACGTGGAGGC GACCCTACGC CTGTCCGACC TGCGGGACCT GGCGCCGGCG GTGGCCCGCT GCCGCCGGCT GCTCGACCTC GACGCCGACC CGACGGCAGT GGACGCCGTC CTGGCCACCG ACCCCGCCCT GGCCGCCGTG GTCACGGCGG AGCCCGGAGT CCGGGTGCCG CGCGCGGTCG ACGGCTTCGA GGTGGCCGTC CGCGCGGTCA TCGGCCAGCA GGTCTCGGTG GCGTCCGCCC GCACCACCCT CACCCGTCTC CTGAACGAGC TACCCACCCT GGCCGACAGG TCGGATGGGG TGACCGGTGG ATTGCACGCG TTTCCCTCCG CCGAGGAGGT GCGCAACGCG CCGGACTCAG CATTCCGGAT GCCGGCCGCC CGCCGGGAGA CGCTGCGCCG GCTCGCGGGG GCGGTTGCCG CCGGGGAGCT CGACCTGGAA CCGGGTGGGG ATCGGAAGGA GACCCGGCAA CGGCTGCTGG CGCTGTCGGG CATCGGCGCG TGGACGGCGG ACTACATCAC GCTCCGCGCG TTGGGCGACC CGGACGTGTT CCTTCCCACC GACGTTGCCG TCCGCCGGGG CGCTGCCGCC CTCGGTCTAC CTAGCACCCC GGACACCCTG CACACGTACG CCGACCGCTG GCGCCCCTGG CGCTCATACG CGGTGAGCCG ACTTTGGAGA GCAGCATGA
|
Protein sequence | MDLDFERCYR AVDSRDQRFD GWFYTGVTST GIYCRPSCPA ITPKRENIRF FPSAAAAQEA GLRACRRCRP DATPGSPHWD VRADVVGRAM RLIADGVVDR SGVPGLAAQL GYTERHLHRM LRTELGAGPL ALARAQRAQT ARTLIETTDL GMAEIAFAAG FGSVRQFNDT VREVYAVAPS ELRAVRSRRT SPAGPGTITV RLAYRPPLHV GALLDFLAPR ALPGVDEVRA GAYHRGLRLP HGTGEASLTP TDRHVEATLR LSDLRDLAPA VARCRRLLDL DADPTAVDAV LATDPALAAV VTAEPGVRVP RAVDGFEVAV RAVIGQQVSV ASARTTLTRL LNELPTLADR SDGVTGGLHA FPSAEEVRNA PDSAFRMPAA RRETLRRLAG AVAAGELDLE PGGDRKETRQ RLLALSGIGA WTADYITLRA LGDPDVFLPT DVAVRRGAAA LGLPSTPDTL HTYADRWRPW RSYAVSRLWR AA
|
| |