Gene Hhal_1652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1652 
Symbol 
ID4709614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1804178 
End bp1805428 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content61% 
IMG OID639856119 
Productaspartate kinase 
Protein accessionYP_001003218 
Protein GI121998431 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGCTGG TAGTCAAAAA ATTCGGTGGG ACATCGGTCG GGACGACCGA GCGCATCCAG 
GCAGTGGCCG AGCAGGTCAA GGCAAGTCGT GAGGCCGGAG ACGATGTGGT CGTGGTCGTC
TCCGCTATGA GCGGCGAGAC CAATCGCCTG GTTGAGCTTG CCGAGTCGAT TCACCCAAAT
CCGCCAGCAC GCGAGGTGGA TGTCCTGTTG TCCACCGGTG AGCAGGTCAC CATCGCGCTG
CTTACGATGG CGCTGGAGCA GATCGATTGC CCCGCGCGCT GCTACACCGG TGCGCAGGTG
CGGATTCTGA CCGATAGTTC ATTCAGCAGG GCGCGAATCC TCGATATCGA TGCCGAGCCG
TTGCAAGAGG ATCTGCAGCG GGGACGTGTC GTGGTGGTTG CGGGCTTTCA GGGTGTCGAC
GAGGAAGGCG CGCTCACCAC GCTGGGCCGT GGTGGTTCCG ATACTACGGC CGTGGCGCTG
GCAGCCGCAC TGGAAGCGAG CGAGTGCCAG ATTTATACCG ACGTGGATGG GGTCTACACC
ACCGACCCGC GCGTCGTACC CGAGGCGAGG CGGTTGGACC GTATAACGGT CGACGAGATG
CTGGAACTCT CCAGTCTGGG CTCCAAGGTC CTGCAGATCC GGGCGGTAGA GTTTGCCGGC
AAGTACCGCG TGCCCCTGCG GGTGCTGTCT AGCTTCGAAG AAGGGCCTGG AACGCTCATC
ACCTACGAGG AAGAAGGAAT GGAAGAGCCG CTGATTTCCG GTATTGCGTT CAACCGCGAC
GAGGCGAAGA TCTCCGTCAT CGGTGTGCCG GACACGCCAG GGATCGCGGC CAAGATCCTC
GGGGCGGTTG CCGAGGCGAA CATCGAAGTG GACATGATCA TCCAGAACAT CAGCCAGCAG
GGGCTGACGG ATTTCACCTT TACGGTCCAC AAGCGCGATT ATCAGGCCAC CCTGGATCTG
GTCCGGGACA ACGCTGAGCA ACTCGGAGCC CGGGAAGTCT ATGGCGACGA CAAGATTGTC
AAGCTCTCCC TGGTCGGGGT CGGGATGCGT TCCCACGCCG GCGTGGCGAA CACCATGTTC
CGTGCGTTGT CCAACGAGGG GATCAACATT CAAATGATCT CGACTTCGGA GATTAAGATC
TCGGTGGTTG TCGAAGAGAA ATACTTGGAG CTTGGCGTGC GCGCCTTGCA CAAGGCCTTC
GAGTTGGATG TACAGCCCGT GGCCGAGCTT GAGGGTCTCT CCGACGAGTA G
 
Protein sequence
MGLVVKKFGG TSVGTTERIQ AVAEQVKASR EAGDDVVVVV SAMSGETNRL VELAESIHPN 
PPAREVDVLL STGEQVTIAL LTMALEQIDC PARCYTGAQV RILTDSSFSR ARILDIDAEP
LQEDLQRGRV VVVAGFQGVD EEGALTTLGR GGSDTTAVAL AAALEASECQ IYTDVDGVYT
TDPRVVPEAR RLDRITVDEM LELSSLGSKV LQIRAVEFAG KYRVPLRVLS SFEEGPGTLI
TYEEEGMEEP LISGIAFNRD EAKISVIGVP DTPGIAAKIL GAVAEANIEV DMIIQNISQQ
GLTDFTFTVH KRDYQATLDL VRDNAEQLGA REVYGDDKIV KLSLVGVGMR SHAGVANTMF
RALSNEGINI QMISTSEIKI SVVVEEKYLE LGVRALHKAF ELDVQPVAEL EGLSDE