Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2376 |
Symbol | |
ID | 5539857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3062419 |
End bp | 3063447 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640894508 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001432476 |
Protein GI | 156742347 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.912933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.295563 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTGGA CGGTCTATGA TAATTCGTCC ATTTCTTTTT GCCTTTGGTC CAGAGTTATG GATTCGCTCT CTCACGTTTT GAACACGGTG CGCTTGAGAA GCAGTGTATA CTGCCGCTCT GAACTTGGTT CGCCGTGGGG GTTGCACTTT GCGCCACGTT CGTGCGCCGT CTTTCATGCT CTGCACCGGG GAAACGGCTA TCTCTGCGTG GAAGGCGACG CCGGTTTGCT GCCGTTGCGC GAGGGAGATG TCGTGCTCCT TCCAGGGGGT GAGGAGCATT CCATTCTGGA AACGCCCGAT GCGCCGCTCT TTCGCAACCT GGAACTCGAT CAGTGGGGCG AGTGCGCCTT GATGCGCTGG AGCGACGCTC CCACAGCGGT GATCCTGTGC GGGACGTTCG ATTTTGAGCA TATCGGCACA TACGCATTAC TGAAACATCT CCCGCGTGTG GTTCACATCC CGCGCAGCGA AAACAGCGCA CTCAACAGCA TTCTGGCGCT GATGGCTTCC GAAGCGGAAG CCGGACGACC GGCCAAAGAA GTGGCGCTGC GCCGCCTGGC GGATATTCTG TTCATTCAGA TTATTCAACG CTGGGTCGAA ATTGAAGGGA TAGAACGCTG CGGTTGGTTC GGCGCCCTGC ACGACCCGCT GATCGGCAGG GCGCTGGAAC TGATCCATGC GCAGCCTCAA CATTCCTGGA CGGTTGCTGC GCTGGCGCGC GCCGTCGCCT GTTCGCGCTC ATTCTTTGCG GCGCGTTTCA CAGCGCTGGT GGGAGAACCG CCGATGGAAT ATCTCCGGCG CTGGCGTCTG CAACTGGCAA CCCACCTGCT GATGGACCAT GCCCACGTCA GCGCTGGCGA CATCGCCGCG CGGATCGGGT ATCATTCCGA AGCAGCGTTT AGCAAAGCCT TCAAACGCTC GTTGGGCATC GCGCCAGGAG CGTACCGGAA GCGCCATAGC GCCGCGAGGT CGTCAACCGA TGCAAGAGAC GTCAATGGTA TAATGAACCG CAACAGCGCC AACCGCTGA
|
Protein sequence | MVWTVYDNSS ISFCLWSRVM DSLSHVLNTV RLRSSVYCRS ELGSPWGLHF APRSCAVFHA LHRGNGYLCV EGDAGLLPLR EGDVVLLPGG EEHSILETPD APLFRNLELD QWGECALMRW SDAPTAVILC GTFDFEHIGT YALLKHLPRV VHIPRSENSA LNSILALMAS EAEAGRPAKE VALRRLADIL FIQIIQRWVE IEGIERCGWF GALHDPLIGR ALELIHAQPQ HSWTVAALAR AVACSRSFFA ARFTALVGEP PMEYLRRWRL QLATHLLMDH AHVSAGDIAA RIGYHSEAAF SKAFKRSLGI APGAYRKRHS AARSSTDARD VNGIMNRNSA NR
|
| |