Gene Haur_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0222 
Symbol 
ID5732117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp258893 
End bp260092 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content52% 
IMG OID641277346 
Producttryptophan synthase subunit beta 
Protein accessionYP_001543002 
Protein GI159896755 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.580627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATC ACGCTGTTCT TGATGAATTA AATGGCCGCT ATGGGGATTT CGGTGGACGC 
TATGTGCCAG AAACCTTGAT GGCCGCGATC GAAGAATTAA CCGAAGCCTT TTTTCGGATT
CGCACCGACC CTGAGTTTCA GGCTGAACTC CAACATTTGC ACCAGACCTA TACGGGCCGA
CCAACTGCCC TCACCTATGC CCGCCGCTTG ACCGAGGAAT TGGGTGGTGC TCAAATTTGG
CTCAAACGCG AAGACCTGAC CCACACTGGC GCACATAAAA TCAATAATGC CTTAGGGCAA
GGCTTGTTGG CCAAACGCAT GGGCAAACAG CGGATCATCG CTGAAACTGG CGCTGGCCAG
CATGGCGTTG CTACCGCTGC CGTTTGTGCC CTGCTTGGGC TGCAATGTGT GGTCTATATG
GGCACCGAAG ATATGGAGCG CCAAAAGCCC AATGTCTTTC GTATGCGCTT GCTGGGAGCC
GATGTGCGTG GAGTCAGCAC TGGCTCGAAA ACCCTCAAAG ATGCAGTTAA CGAAGCCATG
CGCGATTGGG TCAGCAACCC CGATTCGTAC TATTTGCTTG GCTCGGCGCT TGGCCCACAC
CCTTATCCAT TGATGGTACG CGAATTTCAA AGCATCATCG GAATTGAAGC CCGCGAGCAA
ATTTTAGCAG CAACTGGCAA ATTGCCCAAC ACGATTATTG CTTGCGTTGG GGGTGGCTCG
AACGCAATCG GGATGTTCCA CGCCTTTATC AACGATGAAC ATGTTGATTT GCGAGGAGTT
GAAGCTGGTG GTCATGGAAT TGAGCTTGGT CGCCATGCAG CGCGGTTTGC AGGCGGGCGC
TTGGGCGTTT TCCAAGGCAC CCGTTCGTAT GTGCTGCAAA ATAGCGATGG CCAAATTGCC
AATACCCATA GCATTTCTGC TGGTCTCGAT TATGCTGCTG TAGGCCCAGA GCACGCTTGG
CTCCACGACG AGGAACGGGC TTTCTATACC TATGCCACCG ACGAAGAGGC CTTGAATGGT
TTTCAAATGC TCTGTCGAAC TGAAGGCATT ATCCCAGCCT TAGAATCGTC GCATGCGATT
GCCGAAGCTG TACGTTTAGC CCCAACCATG AGCAAAGAAA GCATTATTTT GGTCAATCTG
TCGGGGCGTG GCGATAAAGA TATTTTCACC GTTGCAGATG TATTGGGAGT GCAAATGTAG
 
Protein sequence
MTDHAVLDEL NGRYGDFGGR YVPETLMAAI EELTEAFFRI RTDPEFQAEL QHLHQTYTGR 
PTALTYARRL TEELGGAQIW LKREDLTHTG AHKINNALGQ GLLAKRMGKQ RIIAETGAGQ
HGVATAAVCA LLGLQCVVYM GTEDMERQKP NVFRMRLLGA DVRGVSTGSK TLKDAVNEAM
RDWVSNPDSY YLLGSALGPH PYPLMVREFQ SIIGIEAREQ ILAATGKLPN TIIACVGGGS
NAIGMFHAFI NDEHVDLRGV EAGGHGIELG RHAARFAGGR LGVFQGTRSY VLQNSDGQIA
NTHSISAGLD YAAVGPEHAW LHDEERAFYT YATDEEALNG FQMLCRTEGI IPALESSHAI
AEAVRLAPTM SKESIILVNL SGRGDKDIFT VADVLGVQM