Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0522 |
Symbol | |
ID | 4569117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 575861 |
End bp | 577087 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639765121 |
Product | sigma-70, region 4 type 2 |
Protein accession | YP_911003 |
Protein GI | 119356359 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGA AAATATGTCC TGTCTGCGGG CTTCCTGTCA CTGAACGATC TCATTGGCAC ATCTATCATC CGGAAGAGGA TTATACGATA ACGTATGAGG CTGTCGGAGA TGATATTCTT CATGGTCAGG CGGTTACAGA TCACGACGTT GTGCTTGATT ATATGGATAA TGAGCTTTTT CAGTCTGTCT GTGATGAGCT TAAGATGAGA GGGACAAAGT TTTCTGTTGT TATCAACCTC AAACACATTC GGGGTGTCAC GCTCGCCTAT AAAAAAGATT TTGCCAATCT TGTCTATAAT TGGGGGCCGA TTTTTACCCG GCTTGTTATT TACAATGTCC ACCCTGATAT TTATTCCATT ATCGAAGGGT TCACCGTTAT CTGTCCTGAA AATGTTGAGG CGGTGATTGT TGATGATTAT CGTGACGCCA TAGCTCGCGC TTCAGTCAGC GAAGATAATC CTGAAACCGA TGGCTGTTTC GGTCAGGAGC CGGTTTATGA TTTTCGTACT GCTTCAAAAA AGGCCTTTCT TGCCGATCTT GCGCAGTTGT TCTGGATCGA TATGCTTGAT CAGCCCGTTT TACTGCCGCC GGCGGAGGAT GATACCTACA TTTTTTTCGG TGCGCTTGAA GAGATGCGTA AAGATATGCT GGCAAAAAAA AAGGAGCACC AGGATGAAGT TGAAAGATTG AAGGAGTCTT ATGAGTTGAA GCAAAAACAC TATCTGATTC AGATGAAATC TCTGATTGAT CAGCATAAAG AGATGATTAC ACAGTTTGAA AAGGAGAAGT TGTTGTTGCA AAACATCCTG AAAGTGAATG CGGCTGAAAT GGTCGTGGCG GGTGAGATAA ATACAACTTC GATCGATGGT CTTACCTCTC TTATTGATTC AGCGGCGATG AGCCAGCCAC TCAAGGAGCC GTTGCTGAAC TCTTGCAAGC GGATAGCTCA AACCGGGAAG ATTGAAAACC AGCATAGCGC CGGGGCGTCT GAAGCTGAAC AGATTTTTCT TTCACTGCTT GAGCAAAAAC ATCCGGGACT CTCATGGAGA GATCGCCGGA TAATCCTGTG GATTAAATCG GATTATAGTA ACAGTGAAAT TGCCGGGTTG ATGGGCATAT CAACGCGCGG CATGGAAAGT ATCCGCTACC GGCTCCATAA AAAGCTTGCG CTGCAGAAAC ATCAGACAAT AAAAAGTTAT CTCTCTTCTC TGGAGAGCGA TGCGTGA
|
Protein sequence | MKQKICPVCG LPVTERSHWH IYHPEEDYTI TYEAVGDDIL HGQAVTDHDV VLDYMDNELF QSVCDELKMR GTKFSVVINL KHIRGVTLAY KKDFANLVYN WGPIFTRLVI YNVHPDIYSI IEGFTVICPE NVEAVIVDDY RDAIARASVS EDNPETDGCF GQEPVYDFRT ASKKAFLADL AQLFWIDMLD QPVLLPPAED DTYIFFGALE EMRKDMLAKK KEHQDEVERL KESYELKQKH YLIQMKSLID QHKEMITQFE KEKLLLQNIL KVNAAEMVVA GEINTTSIDG LTSLIDSAAM SQPLKEPLLN SCKRIAQTGK IENQHSAGAS EAEQIFLSLL EQKHPGLSWR DRRIILWIKS DYSNSEIAGL MGISTRGMES IRYRLHKKLA LQKHQTIKSY LSSLESDA
|
| |