Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0401 |
Symbol | |
ID | 4568656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 445899 |
End bp | 446879 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639765001 |
Product | KpsF/GutQ family protein |
Protein accession | YP_910884 |
Protein GI | 119356240 |
COG category | [M] Cell wall/membrane/envelope biogenesis [T] Signal transduction mechanisms |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCTCAC AACCAGAAAC CGCCGCAATA ATTGACTCGG GAAAAAATAT TCTTGAACAG GAAGCGCAAG CAATCCATCG GATCGCAGAC AGGCTCGATG ATAATTTCGC CCGTGCCATT GCATTGATTC TCTCTTGTAA AGGCAAGATC ATTGTCTCCG GAATGGGCAA ATCCGGAATT ATCGGGCAGA AAATTGCGGC GACAATGGCA TCAACCGGAA CAACGGCTCT TTTTCTGCAC CCTGCTGACG CAGCACACGG AGATCTCGGC ATCGTTTGTT CTGGAGATAT TGTCATCTGC CTTTCAAAAA GCGGAACAAC CGAGGAGCTC AATTACATTA TACCGGCACT GAAAAAAACC GGAGCGTCAA TCATTGCCCT GACCGGCAAC AGCCGATCCT ATCTTGCAAA GAGCGCCGAT ATCGTTCTTG ACACTGGTAT CGAGCAGGAA GCCTGTCCTT ATGATCTTGC GCCGACAACA TCAACAACGG CGATGCTTGC CATGGGCGAT GCCCTATCCA TGACACTGAT GCAGGCAAAA AACTTCACCC CCGTTGATTT CGCGCTAACC CATCCAAAAG GATCACTTGG ACGAAGACTG ACCATGAAAG TATCGGATAT CATGGCTTCC GGCGATACCA TGCCTGTGGT TAATGAAGAT GCAGCTGTCA CCGATCTGAT TCTTGAAATG ACCTCAAAAC GCTACGGAGT CAGCGCCATT ATCAACAAGA AAGGGGTATT GACCGGCATT TTTACCGACG GCGACCTTCG TCGGCTTGTC CAGAAAGGTG ACGATTTTCT GAACCTGACA GCGAGGTCGG TCATGACGGC AAACCCAAAA ACCGTTGGAG CAGAAAGGCT TGCAACCGAG TGCCTCGAAA TTCTCGAGAC CTATCGCATT ACACAGCTCA TTGTTTGTGA TATTGATCAG CGTCCTGCAG GCATTATCCA TATTCATGAC CTGATTTCCC TCGGGCTGTA G
|
Protein sequence | MISQPETAAI IDSGKNILEQ EAQAIHRIAD RLDDNFARAI ALILSCKGKI IVSGMGKSGI IGQKIAATMA STGTTALFLH PADAAHGDLG IVCSGDIVIC LSKSGTTEEL NYIIPALKKT GASIIALTGN SRSYLAKSAD IVLDTGIEQE ACPYDLAPTT STTAMLAMGD ALSMTLMQAK NFTPVDFALT HPKGSLGRRL TMKVSDIMAS GDTMPVVNED AAVTDLILEM TSKRYGVSAI INKKGVLTGI FTDGDLRRLV QKGDDFLNLT ARSVMTANPK TVGAERLATE CLEILETYRI TQLIVCDIDQ RPAGIIHIHD LISLGL
|
| |