Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1943 |
Symbol | |
ID | 4570057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2253049 |
End bp | 2254248 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639766525 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_912383 |
Protein GI | 119357739 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0133] Tryptophan synthase beta chain |
TIGRFAM ID | [TIGR00263] tryptophan synthase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.233821 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCCGT TTCACTATTC AGTTCCTGAT TCAAAAGGGC ATTTCGGTAA ATTCGGCGGG AAATTTATCC CGGAAACCCT TATAAAAAAC GCAGCCGATC TTGAGCTCGA ATACAGCCGG GCAAAGAATG ACCCGGTTTT TCAGACAAGG CTTGCAACGC TGCTGCGTGA CTATGTCGGA AGACCTACCC CGCTCTACCT TGCCGAAAGA TTGAGCGGAA TGCTCCAGGG AGCGCGCATC TACCTGAAAC GTGAAGACCT CTGTCACACA GGCGCACACA AAATCAATAA TGCGCTCGGG CAGGTGCTGC TCGCTGAAAG AATGGGCAAA AAACGGGTTA TCGCCGAAAC CGGAGCCGGC CAGCACGGCG TAGCCACCGC TACGGTCTGC GCCCTGTTCG GCATCTCATG CGTTGTCTAC ATGGGCGAAG AAGATATCCG CCGCCAGTCG CCGAATGTTG CTCGAATGAA ACTGCTCGGC GCTGAAGTCA GACCTGTCGG TTCAGGCTCG AAAACACTGA AAGACGCAAC AAGCGAAGCG ATACGGGACT GGATGAACAA CCCTGAAGAT ACCTTTTACA TTATCGGCTC CGTTGTGGGT ATGCACCCTT ATCCCATGAT AGTAAGGGAT TTTCAGTCGG TCATCGGCCG CGAAACCAGA ACGCAGGTTC TCGAACAGGC AGGAAAGCTC CCCGATGTCA TCACAGCCTG CGTCGGTGGA GGAAGCAATG CCATAGGAAT CTTTCATGAA TTTCTTCCGG ACGTTCCCGG CGTTGAACTG GTTGGCGTTG AAGCAGCCGG AGAAGGACTT CAGGGACGTC ATGCCGCATC ACTCACCATG GGTAAAACCG GCGTACTGCA CGGCGCCATG ACAAAACTGC TGCAGGATGA AGACGGCCAG ATCCTTGAAG CGCACTCCAT TTCAGCCGGA CTCGATTATC CGGGAGTAGG ACCTGAACAC TGCTATCTGC AACGAAAAGG TCTGGTCAGC TATACATCGA CTACAGACAA AGAAGCACTT GAGGCGCTTA AAACACTCGC AGCAACCGAG GGAATCATCT GCGCGCTTGA ATCAGCCCAT GCCGTACACT ATGCCATAAA AAGAGCGCCA GAGATGCCGA AAGATGCCAT TCTTGTTGTA AACCTCTCCG GAAGGGGCGA CAAGGATATG GAAACAATAA TGCAGAGTAT CCCTCTCTGA
|
Protein sequence | MVPFHYSVPD SKGHFGKFGG KFIPETLIKN AADLELEYSR AKNDPVFQTR LATLLRDYVG RPTPLYLAER LSGMLQGARI YLKREDLCHT GAHKINNALG QVLLAERMGK KRVIAETGAG QHGVATATVC ALFGISCVVY MGEEDIRRQS PNVARMKLLG AEVRPVGSGS KTLKDATSEA IRDWMNNPED TFYIIGSVVG MHPYPMIVRD FQSVIGRETR TQVLEQAGKL PDVITACVGG GSNAIGIFHE FLPDVPGVEL VGVEAAGEGL QGRHAASLTM GKTGVLHGAM TKLLQDEDGQ ILEAHSISAG LDYPGVGPEH CYLQRKGLVS YTSTTDKEAL EALKTLAATE GIICALESAH AVHYAIKRAP EMPKDAILVV NLSGRGDKDM ETIMQSIPL
|
| |