Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1054 |
Symbol | |
ID | 5538520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1368865 |
End bp | 1369800 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640893190 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001431173 |
Protein GI | 156741044 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.597722 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAGA TTGCCGATCG CCAGGCGGAG CGGCGCGCGC GCGAGGCGCA GTCTCATCGG GAGGAGTTAG TCGAGCGCAT TGCGCGGGCG ATTCGTCAGG ATGGGGAGAT CGAGCCATTG CCGGGGCTTC ATTTCTTCCG CCTTTCCGCC ACGCGCGGGC TGGTGCACAG CGTGAGTCGC CCCGCCTTTT GCATCATCGC CCAGGGGAGC AAGGAACTCT TTCTTGGCGA CCGCCACTAC CGCTACGACC CCTATCACTA CCTGCTTGTC ACTGTCGATC TGCCTGGCGT CAGTCGCGTG CTGGAGGTGT CGCCGACGCG CCCCTACCTC AGCCTGCGCA TCGACCTGTC GCCGGCGCTG GTTGGCGCAG TCATGGCGGA GAGTGGGTAC ACCGCCCCAC CGAGCGTCGC CAGAGTTGGC GCGGTGGATG TCAGTTTGCT GGATGGGGAG TTGCTCGATG CTGTCGTGCG GTTGGTGCGG CTCACCGAGG CGCCTGCCGC TGACGTACAG GCGCTCAAGC CGCTGATTGT GCGAGAAATC GTCTATCGTC TGCTGGTGGG CGATCAGGGC GCGCGGTTGC GCCATCTGGC GGGTGTGGGA GGCTCCATCT CCCACATTGC GCGCGCCGTA GAGTGCATCC GGCGGAACTT CCACCAGCCT CTGCGCATCG AGCAACTGGC GCGCGAGGCG GGCATGAGCG TCTCTGGCTT TCACCACTAT TTCAAAGCCG TCACCGGCAT GACGCCGCTG CAATTTCAGA AGCAACTGCG CCTGCACGAA GCGCGACGAC TCATGATTGG GGAAAACCTG CCCGCGATCG ACGCCGCCTA CCGCGTCGGC TACCAGGACG CCTCCCATTT TAACCGGGAG TACAAACGGC TCTTCGGTGC GCCGCCGCTG CGCGATGTGC AACGGCTGCG CGACGCAATG GCGTGA
|
Protein sequence | MQQIADRQAE RRAREAQSHR EELVERIARA IRQDGEIEPL PGLHFFRLSA TRGLVHSVSR PAFCIIAQGS KELFLGDRHY RYDPYHYLLV TVDLPGVSRV LEVSPTRPYL SLRIDLSPAL VGAVMAESGY TAPPSVARVG AVDVSLLDGE LLDAVVRLVR LTEAPAADVQ ALKPLIVREI VYRLLVGDQG ARLRHLAGVG GSISHIARAV ECIRRNFHQP LRIEQLAREA GMSVSGFHHY FKAVTGMTPL QFQKQLRLHE ARRLMIGENL PAIDAAYRVG YQDASHFNRE YKRLFGAPPL RDVQRLRDAM A
|
| |