Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0781 |
Symbol | |
ID | 3748066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1099727 |
End bp | 1101163 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637773311 |
Product | sigma-54 factor |
Protein accession | YP_379090 |
Protein GI | 78188752 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATT TTACGCTTCA GCAAAAGCAA GTACAACATC TATCAGCGCA ACAAATTCTT GGGAGCCAGC TTTTACAGCT TCCAATGCAG CAGCTTGAAG AGCGCATTTA TCAAGAAGTG CAAGAAAATC CCATGCTCGA ATTAGTTGAA GCGCCTCGTG ATGGGCAAAT TGATGGTGTG GTAGCTGAGC CAAGCAATGG AGCCGTGGGG GAGATGTTTG ATTCCATTGA TCGCTTTAGC CGAGCTTCAC TCAATAGTCG TGTTCATAGC GGTAGCCCTT CAAGCGGGCA GGGGGGAAGT GATGATAGCA AAGAGCGCTT TTTTCAAGCG GTGCAGCACG ATAGTTTAGC AGAACTACTG TGTCGCCAAC TTGCTTTGCA AGAGCATATT GGTGAGCGTG AAATGGCAAT TGCCGAAGAG ATTCTCGGCA ACCTTGATAG CGATGGCTAT TTTACCGAGT CCATTGAGCT TATTGTTGCA AGTCTTCAGC AAGCGGAGGT TATTGTTAGT AATGCTGAAG TGGAAGCGGT GCTTCATTCC ATCCATTTTC TTGACCCAGC AGGCATTGCC GTGCGCAATG TGCAACAGCG TTTAATGGTG CAGTTGCAAG TTGCCGCCCA TCGTTATCCA GCCGCAACAT ATAATGTAGC AATGCGCTTG CTTGGCGATT ATTACGACGA CTTTTTAAAT CGTCGGTTAG ATATGCTCCT TAAAAAACTT GGAGTGCCAA AAGCAGAGCT TGAAGCTGCC GTTACGGCTA TTATTGCGCT CGATCTTCAT CCGGGTGTTT TTTACGATGA GGGTGGGCAT TACATTAGCC CCGATGTTAT TGTTACCTAC GAAAATGGTG AATTAACAGC GGCATTAAAT GACCGAAGCG CTCTCTCGGT TAAAGTAACC GATCGCTATC GTGAACTGCT TGCAAATCGT AAAGCGCCGA AAGAGGAGAA GCAATTTATT CGCCACAACA TTCAGCGTGC TCAAGATTTT GCAACAGCCT TAGCCATGCG CCGCCAAACC CTTTTAAAAG TAATGGAAGC GCTTTTAAAG CAGCAATATG CTTTTTTTGT TTCGGGACCT GAGCATGTTG TACCACTTGG CATGAAAAGT GTTGCTGAAG AGACGGGGCT TGATATTTCC ACCATTAGCC GTGCGGTAAA TGGCAAATAT GTACAAACTC GCTTTGGAGT CTTTGAATTG CGTTACTTTT TTGGAAGTGC ACTCTCAACC GATGAGGGCG AAGAGCTTTC AAGCAAAATT ATTCGTCAAC ATCTTGCCGA AATAATTAAA GCCGAAGATT CAGCCCATCC ATTAAGCGAT GACACGCTTG CCGAAATGCT GGTGAGTAAA GGTATTCGTA TTGCTCGCCG AACGGTTGCA AAATACCGTG AACAAATGCA AATTCCCGTT GCAAGATTAA GAAAAAAAAT ATTTTAA
|
Protein sequence | MADFTLQQKQ VQHLSAQQIL GSQLLQLPMQ QLEERIYQEV QENPMLELVE APRDGQIDGV VAEPSNGAVG EMFDSIDRFS RASLNSRVHS GSPSSGQGGS DDSKERFFQA VQHDSLAELL CRQLALQEHI GEREMAIAEE ILGNLDSDGY FTESIELIVA SLQQAEVIVS NAEVEAVLHS IHFLDPAGIA VRNVQQRLMV QLQVAAHRYP AATYNVAMRL LGDYYDDFLN RRLDMLLKKL GVPKAELEAA VTAIIALDLH PGVFYDEGGH YISPDVIVTY ENGELTAALN DRSALSVKVT DRYRELLANR KAPKEEKQFI RHNIQRAQDF ATALAMRRQT LLKVMEALLK QQYAFFVSGP EHVVPLGMKS VAEETGLDIS TISRAVNGKY VQTRFGVFEL RYFFGSALST DEGEELSSKI IRQHLAEIIK AEDSAHPLSD DTLAEMLVSK GIRIARRTVA KYREQMQIPV ARLRKKIF
|
| |