Gene Hhal_0124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0124 
Symbol 
ID4710621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp142472 
End bp143611 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content68% 
IMG OID639854582 
Productthreonine synthase 
Protein accessionYP_001001720 
Protein GI121996933 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTTCC GACCCCGTTA CACCGGACTG ATCGCCAAGT ACCTGGACCG GCTGCCGATC 
AGCGACGACG CGCGCATCCT CGGGCTGGGT GAGGGGAATA CGCCGCTGAT CCAGCTGACC
CGCATCCCGG CGGAGCTCGG ACGGGACGTG GATCTCTACG TCAAGTTCGA GGGACTCAAC
CCGACGGGGT CGTTCAAGGA TCGGGGCATG ACCATGGCGG TCACCAAGGC CGTCGAGCAG
GGCGCCAAGG CGATCATCTG TGCCTCGACC GGCAACACCT CCGCTTCGGC GGCCGCCTAT
GCAGCCCGCG CCGGGATTAG CTGCTTCGTG CTCATTCCCG ATGGCAAGAT TGCCATGGGC
AAACTGGCTC AGGCGATCAT GCACGGCGCT CAGGTGCTGC AGATCCGCGG CAATTTCGAC
GCCGGCATGC GGCTGGTCAA GGAGCTGGCC GAGCACGCGC CCCTGACGAT CGTCAACTCC
ATCAATCCGT ACCGCCTGCA GGGGCAGAAG ACCGCCGCCT TCGAGATCAT CGAGGAGCTC
GAGCGGGCGC CGGATTATCA CTGCCTGCCG GTGGGCAACG CCGGCAACAT CACCGCCCAC
TGGATCGGCT ATAGCGAGTG CGCCGGCCGC ACGGGCGACG AACAGCTGAC GGCGGCCTGC
GCCTTCTGCG GGGGGCAGTG CCGGTACGCC TCGGCGCTGG TGGAGCGGCG CCCGCGCATG
GTGGGCTACC AGGCCAGCGG CAGCGCGCCG TTCCTGCGGG GCGGCCCGGT GGCCGAGCCG
GAGACGGTGG CGACCGCCAT CCGCATCGGT GATCCGCAGT CGTGGGACTA CGCCCAGGCC
GTCCGCGAGG AGTCCGGGGG GTGGTTCGAT GAGCTGAGCG ACGAGGAGAT CCTCCAGGCC
CAGCGCATGC TCGCCGATCA CGAGGGGGTC TTCTGCGAGC CCGCATCGGC GACCTCGGTA
GCGGGGGCCA TGCGGGATAT CCGCAGCGGC CGCATCCCCG AAGGCAGTAC GGTGGTCTGC
ACCCTGACCG GCCACGGCCT CAAGGATCCG GATGTGGCGA GCGCCCAGGC CGGCGATGCG
GTTCAGACCG TGGATGCCGA CTACCAGGCG GTTCGCGAGG CCATCCTGAA GCGGCTTTGA
 
Protein sequence
MPFRPRYTGL IAKYLDRLPI SDDARILGLG EGNTPLIQLT RIPAELGRDV DLYVKFEGLN 
PTGSFKDRGM TMAVTKAVEQ GAKAIICAST GNTSASAAAY AARAGISCFV LIPDGKIAMG
KLAQAIMHGA QVLQIRGNFD AGMRLVKELA EHAPLTIVNS INPYRLQGQK TAAFEIIEEL
ERAPDYHCLP VGNAGNITAH WIGYSECAGR TGDEQLTAAC AFCGGQCRYA SALVERRPRM
VGYQASGSAP FLRGGPVAEP ETVATAIRIG DPQSWDYAQA VREESGGWFD ELSDEEILQA
QRMLADHEGV FCEPASATSV AGAMRDIRSG RIPEGSTVVC TLTGHGLKDP DVASAQAGDA
VQTVDADYQA VREAILKRL