Gene Hhal_1808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1808 
Symbol 
ID4710995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1981131 
End bp1982153 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content66% 
IMG OID639856278 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_001003374 
Protein GI121998587 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01296] aspartate-semialdehyde dehydrogenase (peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.996773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC AGTACGATGT CGCCGTAGTC GGGGCCACCG GCGCGGTCGG AGAGGTGATG 
CTCTCCATTC TGGCCGAGCG CGGCTTCCCC GCGCGCAAGA TCTACCCGCT GGCCAGTGCC
CGCTCCGCCG GGCGGACGGT ATCCTTCGCC GGGCAGGAGC TGGAGATCCA GGACCTGGCC
CAGTTCGACT TCTCCCAGGT GCAGATCGCG CTCTTCTCCG CCGGGGGGTC CATCTCCGCC
GAGCACGCAC CGCGGGCGGC CGAGGCCGGG GCGGTGGTGA TCGACAACAC CTCCCACTTT
CGCTACGACG ACGACATCCC GTTGATCATT CCCGAGGTCA ACCCGCACGC GGTGGCTGGC
TACAAGAAGC GGGGGATCAT CGCCAATCCC AACTGTTCCA CCATCCAGAT GCTCGTGGCC
CTCAAGCCGA TCCACGACGC CGTGGGCATC GAGCGGATCA ACGTGGCCAC TTACCAGGCG
GTCTCCGGCA GTGGCAAGCC GGCCATCGAC GAGCTCAACG CCCAGAGCCG GGCGATCCTC
GACGGCGGTG AGCCGCAGTG TGCTGAGTAC CCGAAGCCCA TCGCGTTCAA TTGCCTGCCG
CACATCGACG ATTTCCAGGA CAACGGCTAC ACCAAGGAAG AGATGAAGAT GGTCTGGGAG
ACCATCAAGA TCTTCGAGGA CTCCTCCGTT CGGGTGAATC CCACCACGGT GCGTGTGCCG
GTGGTCTACG GCCACTCCGA GGCCGTGCAC ATCGAGACCC GCGAGCGCAT CACCGCCGAG
CGTGCCCGGC AGGTGCTCTC CAGCGCCCCC GGGGTCGAGG TCCTGGACGA GCGCACAGGC
GGCGGCTATC CGACGGCGCT GACGGAGGCC GCCGGACGCG ATCCGGTCTA CGTCGGGCGC
ATCCGCGAGG ACATCAGCCA CGAGCGGGGT CTCGATCTCT GGGTGGTGGC CGATAACGTC
CGCAAGGGGG CGGCGCTGAA CAGCGTGCAG ATTGCGGAGC TGCTGATTGG CGAACACATC
TGA
 
Protein sequence
MSKQYDVAVV GATGAVGEVM LSILAERGFP ARKIYPLASA RSAGRTVSFA GQELEIQDLA 
QFDFSQVQIA LFSAGGSISA EHAPRAAEAG AVVIDNTSHF RYDDDIPLII PEVNPHAVAG
YKKRGIIANP NCSTIQMLVA LKPIHDAVGI ERINVATYQA VSGSGKPAID ELNAQSRAIL
DGGEPQCAEY PKPIAFNCLP HIDDFQDNGY TKEEMKMVWE TIKIFEDSSV RVNPTTVRVP
VVYGHSEAVH IETRERITAE RARQVLSSAP GVEVLDERTG GGYPTALTEA AGRDPVYVGR
IREDISHERG LDLWVVADNV RKGAALNSVQ IAELLIGEHI