Gene Cphamn1_0356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0356 
Symbol 
ID6374017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp369040 
End bp370431 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content54% 
IMG OID642682874 
Producttryptophan synthase subunit beta 
Protein accessionYP_001958804 
Protein GI189499334 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00422176 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCAG AACCAACAAA GATCCTGTTA CACGAGGATG AAATGCCTCG TCAGTGGTAC 
AACATCCAGG CAGACCTTCC TACCCCTATA CCCTTACCCC TCGGCATGGA CGGCAAGCCG
ATCAATCCTG AAGACCTTGC GCCGGTCTTC CCCATGAACA TTATTGAACA GGAACTCAGC
ACCGAACGCT GGATTACCAT TCCCGAGAAG GTCCGGGAAC TTCTCACTCT CTGGCGCCCT
TCTCCGCTCT ACAGAGCCAA AGGCCTTGAA AAAGCACTGA ATACTCCAGC CAGGATCTAC
TATAAAAACG AAGGTGTTTC ACCAGCCGGG AGCCATAAAC CAAACACCGC TGTCGCCCAG
GCATATTACA ACAAGGAATT CGGCATCAAA TATCTGACAA CCGAGACCGG TGCCGGTCAA
TGGGGAAGCG CGCTTGCAAT GAGCTGCAAG CTTGTCGGCA TCGAGTGCAA GGTGTTTATG
GTGCGCATCA GCTTTGATCA GAAACCGTTC AGGAAAATCA TGATGAAAAC CTGGGGTGCC
GACTGCATTG CGAGCCCGAG CGAAACAACC GGGATCGGAC GTAAAATCCT GGCTGAACAG
CCGGACACTC CCGGAAGTCT CGGCATTGCG ATCAGCGAAG CTATCGAGGA AGCCGTCCAG
CGAGAGGATA CCCGCTACTC TCTCGGCAGC GTCCTGAATC ATGTCATGCT GCATCAGACG
ATCATCGGGC TGGAAGCACA GAAACAGTTC GACAAAATCA ACCGGTACCC GGATATCGTT
ATTGGCTGCG CCGGCGGGGG CTCCAACTTC GCAGGAATCA GTTTCCCGTT CATTCATGAT
AAAATCAACG GAAAAGATCT TCGTGTGATC GCAACCGAGC CCGAAGCCTG CCCGACCCTC
ACAAGAGGAC CGTATGTCTA TGACTCCGGA GACGTAGCGA AAATGACTCC TCTGCTTGCG
ATGCACAGCC TCGGTCACGG CTTTATCCCC CCTTCGATTC ATGCTGGAGG GCTTCGTTAC
CACGGTATGG CCCCACTGGT CAGCCATGTG CTGCAGCAAG GCCTTATTGA AGCAAACGCA
CTGCCGCAGA CTGAATGTTA CAAGGCTGCA CTGCTTTTCG CCCATACAGA AGGGTTCATT
CCCGCTCCGG AAACCTCGCA CGCTATCGCG CAAACTATAC GGGAAGCAAA CCGTGCACGC
GAAGAGGGAA AAGAAAAGAC TATTCTGATG AACTGGTCCG GTCATGGGCT CATGGACCTG
CAGGGATACG ACGCCTTCAT GTCGGGAAAA CTCGAAGACT ATCCTTTGCC GGAAGAACTC
CTGCAACAGT CGCTGGCGGA CATCAAGGAT CATCCGAAGC CACCCGTCTC TCCCTGTCAT
TCCCGTGCCT GA
 
Protein sequence
MSSEPTKILL HEDEMPRQWY NIQADLPTPI PLPLGMDGKP INPEDLAPVF PMNIIEQELS 
TERWITIPEK VRELLTLWRP SPLYRAKGLE KALNTPARIY YKNEGVSPAG SHKPNTAVAQ
AYYNKEFGIK YLTTETGAGQ WGSALAMSCK LVGIECKVFM VRISFDQKPF RKIMMKTWGA
DCIASPSETT GIGRKILAEQ PDTPGSLGIA ISEAIEEAVQ REDTRYSLGS VLNHVMLHQT
IIGLEAQKQF DKINRYPDIV IGCAGGGSNF AGISFPFIHD KINGKDLRVI ATEPEACPTL
TRGPYVYDSG DVAKMTPLLA MHSLGHGFIP PSIHAGGLRY HGMAPLVSHV LQQGLIEANA
LPQTECYKAA LLFAHTEGFI PAPETSHAIA QTIREANRAR EEGKEKTILM NWSGHGLMDL
QGYDAFMSGK LEDYPLPEEL LQQSLADIKD HPKPPVSPCH SRA