Gene Hhal_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0125 
Symbol 
ID4710622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp143617 
End bp144927 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content66% 
IMG OID639854583 
Producthomoserine dehydrogenase 
Protein accessionYP_001001721 
Protein GI121996934 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACCGG TTAATGTGGG GATGCTCGGC CTCGGCACCG TGGGTTCGGG CGTGGTCAAT 
ATCCTCGAGC GCAACGCCGA TGTGATCAAC CGGCGCGCCG GGCGTGAGAT CCGCGTTACC
CACGCCTCGG CGCGCCACCC GGAGCGGCCG CGCAGCTGTC GGCTGGAGGG GATTCGCCTG
ACCACCGATC CCTTCGAGGT GGTCGATGAC CCCGAGGTGG AGATCATCGC CGAGCTTATC
GGCGGCCACG AGCCGGCTCG GGAGCTGGTG CTGCGGGCCC TGGATAACGG CAAGCACGTC
ATCACGGCCA ACAAGGCGCT GATCGCCTCC CACGGGAACG AGATCTTCGC CCGCGCACGT
GAGAATGGTG TCACCGTCGC CTTCGAGGCG GCCGTTGCCG GCGGCATCCC GATCATCAAG
GCGGTCCGCG AGGCCTTAAC CGGTAATCGC ATCGAGTGGC TGGCGGGGAT CATCAACGGC
ACCTCGAACT ACATCCTCAC CGAGATGTTC TACGAGGGGC GGCAGTTCGG TGATGTGCTC
GCCGAGGCGC AACGCCTGGG CTACGCGGAG GCGGATCCGA GCTTCGATGT GGACGGCACC
GACGCCGCCC ATAAGCTCAC CATCCTGGCC TCCATCGCCT TCGGGATCCC GCTGCAGTAC
GACAAGGTCT ACGTCGAGGG GATTGACCAC ATCACCCGCG AGGATGTCGC CTTCGCCGAA
GAGCTGGGCT TCCGCATCAA GCACCTGGGG ATGGCCTTCC ACGAGGAAGG GGGGTACGCA
CTGCGGGTCC ACCCGACCCT GCTGCCGCGG CGGCACATGC TGGCCAACGT GGACGGGGTG
ATGAACGCCG TCATGGTCAA GGGGGACGCG GTCGGCCCGA CGCTCTACTA CGGCGCCGGG
GCCGGCGCCG AGCCGACCGC CTCGGCGGTG GTGGCCGACA TGATCGATGT GGTCCGCGAG
TTCAACCTCG AGCCGGAAAA CCGGGTGCCG TATCTGGCCT TCCACACGGA GTCCCTCTCT
CGGGAGCCGG TGCTGCCCAT GCGCGATGTC GAGACCGCCT ACTACCTGCG GCTCTCGGCC
AGGGACGAGC CCGGTGTGCT GGCGGATGTC ACTCGTGTGT TGGGGGATTT CGGGATCTCT
ATCGAGGCCA TTATCCAGAA ACAGCCGCAG GCGGGGGCCG AACACGTCCC GATCATCCTG
CTGACCCACC GAATCCACGA GCGGCATATG GATGCGGCCA TCGAGCGCCT GGAGCACCTG
GAGCAGGTTG ATGGCCGTGT CGTGCGCATC CGCGTCGAGA GCCTGGAGTG A
 
Protein sequence
MKPVNVGMLG LGTVGSGVVN ILERNADVIN RRAGREIRVT HASARHPERP RSCRLEGIRL 
TTDPFEVVDD PEVEIIAELI GGHEPARELV LRALDNGKHV ITANKALIAS HGNEIFARAR
ENGVTVAFEA AVAGGIPIIK AVREALTGNR IEWLAGIING TSNYILTEMF YEGRQFGDVL
AEAQRLGYAE ADPSFDVDGT DAAHKLTILA SIAFGIPLQY DKVYVEGIDH ITREDVAFAE
ELGFRIKHLG MAFHEEGGYA LRVHPTLLPR RHMLANVDGV MNAVMVKGDA VGPTLYYGAG
AGAEPTASAV VADMIDVVRE FNLEPENRVP YLAFHTESLS REPVLPMRDV ETAYYLRLSA
RDEPGVLADV TRVLGDFGIS IEAIIQKQPQ AGAEHVPIIL LTHRIHERHM DAAIERLEHL
EQVDGRVVRI RVESLE