Gene Cpha266_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0335 
Symbol 
ID4570533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp365166 
End bp366539 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content53% 
IMG OID639764932 
Producttryptophan synthase subunit beta 
Protein accessionYP_910818 
Protein GI119356174 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTACAG ATCTCACCAA GATCATTCTT GACGAACATG AAATGCCCCG TCAGTGGTAC 
AACATTCAGG CTGACCTTCC TGTACCATTA CCCCCGCCAG TGGGCATGGA CGGCACCCCG
ATATCTCCCG ACGATCTGGC ACAGGTGTTT CCGATGAACC TGATTGAGCA GGAGATGAGC
ACCGAGCGAT GGATAGACAT CCCTGAGGAG GTGCTCTCGA TCCTGAAACT CTGGCGCCCT
TCGCCGCTCT ATCGTGCAAA GCGTCTCGAA AAAGCCCTCG GAACTCCGGC AAAAATCTTT
TATAAAAACG AAGGTGTTTC GCCTGCGGGA AGTCACAAAC CAAACACGGC TGTTCCCCAG
GCATGGTACA ACAAGCAGTT CGGTATAAAA TATCTTACCA CCGAAACCGG CGCAGGGCAG
TGGGGCAGCG CTCTTGCGAT GAGCTGCAAG CTTATCGGTA TTGAGTGCAA GGTCTTTATG
GTTCGCATCA GTTTCGATCA GAAACCGTTT CGAAAAATCA TGATGAAAAC CTGGGGCGCA
GAGTGCATTG CCAGCCCCAG CCGCGAAACG GCAATCGGAC GACGCATTCT GGAGGAGATG
CCTGATACAC CGGGAAGCCT CGGTATTGCC ATAAGCGAAG CAATCGAACA GGCTGTAGAG
CGTGAGGATA CCCGATATGC TCTCGGCAGC GTGCTGAACC ATGTGATGCT TCACCAGAGC
ATCATCGGTC TTGAAGCTCA AAAACAGTTT GAAAAAATCG GACTCTACCC CGACGTCGTT
ATCGGGTGCG CAGGAGGCGG ATCGAATTTT GCCGGCATCA GTTTTCCTTT CATCTGCGAC
AAGATTCATG GAAAAGATAT TCAGATCATT GCTACCGAGC CCGAAGCCTG CCCGACGCTG
ACCAAAGGCC CCTATATTTA TGATTCCGGA GATGTGGCAT TAATGACTCC GCTGCTCGCC
ATGCACAGTC TCGGTCACGG CTTCATTCCT CCTGCGATAC ATGCCGGAGG GCTCCGCTAT
CACGGCATGG CGCCACTGGT CAGCCATGTC AAACAGCTCG GCCTGATTGA GGCAACCGCA
CTGCCGCAGA CAGAGTGCTA TGAAGCCGCT CTGCTGTTTG CGCACACCGA AGGCTTTATC
CCTGCACCGG AAACCTCTCA CGCTATTGCG CAAACCATCC GTGAAGCTAA AAAAGCAAAA
GAGGAAGGAA AAGAACGGGT TATTCTCATG AACTGGTCCG GCCACGGACT CATGGATCTG
CAAGGCTACG ATGCCTTTCT TTCCGGCAAA CTCAGCGATT ACGCCCTGCC GGATGAGCTT
TTGCAACAAT CACTTGCCGC AGTCAGGAAT CATCCCAAAC CGCCAGGCGC ATAA
 
Protein sequence
MGTDLTKIIL DEHEMPRQWY NIQADLPVPL PPPVGMDGTP ISPDDLAQVF PMNLIEQEMS 
TERWIDIPEE VLSILKLWRP SPLYRAKRLE KALGTPAKIF YKNEGVSPAG SHKPNTAVPQ
AWYNKQFGIK YLTTETGAGQ WGSALAMSCK LIGIECKVFM VRISFDQKPF RKIMMKTWGA
ECIASPSRET AIGRRILEEM PDTPGSLGIA ISEAIEQAVE REDTRYALGS VLNHVMLHQS
IIGLEAQKQF EKIGLYPDVV IGCAGGGSNF AGISFPFICD KIHGKDIQII ATEPEACPTL
TKGPYIYDSG DVALMTPLLA MHSLGHGFIP PAIHAGGLRY HGMAPLVSHV KQLGLIEATA
LPQTECYEAA LLFAHTEGFI PAPETSHAIA QTIREAKKAK EEGKERVILM NWSGHGLMDL
QGYDAFLSGK LSDYALPDEL LQQSLAAVRN HPKPPGA