Gene Hhal_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1804 
Symbol 
ID4711015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1975979 
End bp1977193 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content71% 
IMG OID639856274 
Producttryptophan synthase subunit beta 
Protein accessionYP_001003370 
Protein GI121998583 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.603761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACCTG AGCAGGACGG CGGCGGCCCC GCCGAGGAGC ATGATGGGCA CTTCGGCCGG 
TTCGGGGGGC GGTTCGTCTC CGAGACCCTG GTCGGGCCGC TCGAGGAGTT GACCGAGGCG
TACGCCGAGG CGCGCCGCGA CCCGGGTTTC CAGGCCGAGC TCGATCGCGA GTTGCGGGAT
TTCGTCGGTC GGCCCACTCC GCTGTACCTG GCCGAGCGGC TGACCGCCCG CGCCGGCGGT
GCACGCATCT ACTTCAAGCG CGAGGATCTG GCCCATACCG GCGCGCACAA GGTCAACAAT
ACCGTGGGGC AGGCGGTGCT GGCCGCGCGC ATGGGCAAGA CGCGGATCAT CGCCGAGACC
GGGGCGGGGC AGCACGGCGT GGCTACGGCA ACCGTCGCGG CACGCATGGG GCTGGAGTGC
GTCGTTTACA TGGGGGCCGA CGACGTGCAG CGCCAGGCCG CCAACGTCTA CCGGATGCGT
CTGCTCGGCG CTGAGGTCCG CGCCGTGGAC GCCGGGACGC GCACCCTCAA GGACGCCATG
AACGAGGCGA TGCGCGACTG GGTGGCGAAT ATCGACAACA CCTTCTACAT CATCGGCACC
GTCGCCGGTC CCCATCCCTA CCCGACGCTG GTTCGCGACC TGCAGCGGGT CATCGGTGTC
GAGACCCGAG CGCAGATCCT GGAGCGCGAG GGGCGCTTGC CCGACGCCGT GGTCGCCTGC
GTCGGCGGCG GTTCCAACGC CCTGGGGATC TTCCATCCGT TCCTGGATGA CGCCGATATC
CGGCTCGTCG GGGTCGAGGC CGGGGGCGAG GGCCTGGCCT CCGGCCGGCA CGCCGCCCCG
CTCAACGCCG GCCGCCCCGG TGTGCTCCAC GGTGCGCGCA GTTACCTGAT GGAGTCCGAC
GAGGGGCAGA TCATCGGTAC GCATTCGATC TCCGCCGGGC TCGATTACCC CGGCGTCGGC
CCTGAGCACG CGTGGCTGAA GGATTCCGGG CGCGCCGAGT ATGTGACGGT CACCGACGCC
GAGGCCCTGG CCGCTTTCCA CCGGCTCAGC CGCACCGAAG GCATCCTGCC GGCGCTGGAG
ACCTCCCACG CCGTCGCCCA CGCGGAACGC CTGGCCGCGG AACTCGGCCC GGACGCTGCG
CTGGTGGTCA ATCTCTCCGG GCGGGGCGAT AAGGACATCG CCACGGTCGC GGCGCAGGAG
GGCATCGAGC TGTGA
 
Protein sequence
MRPEQDGGGP AEEHDGHFGR FGGRFVSETL VGPLEELTEA YAEARRDPGF QAELDRELRD 
FVGRPTPLYL AERLTARAGG ARIYFKREDL AHTGAHKVNN TVGQAVLAAR MGKTRIIAET
GAGQHGVATA TVAARMGLEC VVYMGADDVQ RQAANVYRMR LLGAEVRAVD AGTRTLKDAM
NEAMRDWVAN IDNTFYIIGT VAGPHPYPTL VRDLQRVIGV ETRAQILERE GRLPDAVVAC
VGGGSNALGI FHPFLDDADI RLVGVEAGGE GLASGRHAAP LNAGRPGVLH GARSYLMESD
EGQIIGTHSI SAGLDYPGVG PEHAWLKDSG RAEYVTVTDA EALAAFHRLS RTEGILPALE
TSHAVAHAER LAAELGPDAA LVVNLSGRGD KDIATVAAQE GIEL