Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A1029 |
Symbol | |
ID | 3833490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 1217884 |
End bp | 1218939 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637825118 |
Product | AraC family transcriptional regulator |
Protein accession | YP_426117 |
Protein GI | 83592365 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.221048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCATA GCCTGGACTC ACGGCACATG TGCCAGAATC GCCTATCCTT TGGGCAACGC AATATCCTGG CGGCGGCGGC TTGCGGCATC GCCCAGGGCA TTCGCGCCCA GGGGGCCGAT GCCGACGCGG TGTTCATCAA GGCGGGGGTG CGCGAAAGCG ATCTGGGCGA TCCCCGGCTG TCGCTCGACC TGGGATCTTA TGTGGCGATG TTCGAACTGG CCGCCGTCGC CACCGGCAAC GACAATTTCG GCCTATGGTT CGGTCAGGGC TTTCTGCCGC CGATGCTCGG GCTGATTGGC GAGATCGCCC TGTGCTCGCC GACCCTGGGC AGCGCCCTTG ATAACCTCGC CACGCTTTTT CCCTTCCATC AGCAGGCCAC CCAAACCCGC CTGCGCCGCG ACGGAACCCT GCTCCGCTTG GAATACCGCA TCCTTGATGG CCGGATCATC GACCGCCGCC AGGATGCCGA ACTGACCATG GGCATGTTCG CCAATGTGCT GCGCGCCGCA TTGGGGCCGG GCTGGCGTCC CGAGGAGGTG CATTTCGAGC ACCCCCGGCC CGAGGGATGG GCCGCCCACG GCCGGGCCTT CGACGCCGAT ATCCATTTCG GCCAACCGAC CAACGCCCTG GTCTTTCGCG ACCGCGACCG CGAGCGCCCG ATGCCCGCCG GCGACCTCGG CCGCCTGACC CGCCTGCGCG ACGAGTTACT CAGCGTCAGC GGCGGCACCG GCCGGGTTCC CTTCGTCGAA CAGGTGCGCG GCGAAACCCG CCGCCTGCTG ACCGAAGGCG CCCCCCATAT CGAGGATGTG GCCGAGGCCC TGGGTCTGGC GCGCTGGACC CTGCAGCGCC GGCTGGCCGA CGAGGGGTTG AGCTTTTCCG ATGTGGTCGA CGACCTGCGC CGCACCTTGG CCAAACGCTA TGTCAGCCAG CCCCATGTGC CCTTGGCCGA TATCGCCCAA TTCCTCGGCT ATTCCGAACC CAGCGCCTTC TCCCGCGCCT TCGTCCGCTG GTTCGGCATC TCCGCCCAAC AGATGCGCCG CGCCGAAGCG GCTTAG
|
Protein sequence | MSHSLDSRHM CQNRLSFGQR NILAAAACGI AQGIRAQGAD ADAVFIKAGV RESDLGDPRL SLDLGSYVAM FELAAVATGN DNFGLWFGQG FLPPMLGLIG EIALCSPTLG SALDNLATLF PFHQQATQTR LRRDGTLLRL EYRILDGRII DRRQDAELTM GMFANVLRAA LGPGWRPEEV HFEHPRPEGW AAHGRAFDAD IHFGQPTNAL VFRDRDRERP MPAGDLGRLT RLRDELLSVS GGTGRVPFVE QVRGETRRLL TEGAPHIEDV AEALGLARWT LQRRLADEGL SFSDVVDDLR RTLAKRYVSQ PHVPLADIAQ FLGYSEPSAF SRAFVRWFGI SAQQMRRAEA A
|
| |