Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0667 |
Symbol | |
ID | 7401802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 683674 |
End bp | 684672 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643707733 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_002565339 |
Protein GI | 222479102 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.448308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00548285 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACGGACA CCACCACCAC GACACCCTAC GCGACGCAGG ACGAGACGGA CCGAGAGACG ACCGACGAGC CCTTCGAGGG CGTCTACCCG GCGATGACGA CCCCGTTCAC GGACGACGAC GAGGTCGATC ACGAGCAGCT CGCCGCCAAC GCCCGCTACC TCGAACGCGC CGGCGTCGAC GGCGTGGTCC CCGTCGGCTC CACCGGCGAG TCGGCGACAA TGAGCCACGA CGAGCACGTC GACGTGATCG AGACGGTCCG CGACGCCCTC GAAGACGTGC CGGTGGTCGC CGGAACCGGG TCGAACAACA CCGCCGAGGC GCTCTCGCTG TCCGAGCGCG CCGCCGATGC GGGGGCCGAC GGCCTCCTCC TCATCTCCCC GTACTACAAC AAGCCCGAAC CGCAGGGCTT TCTGGAGCAC TACCGGACGA TCGCCGACGA AGTCGATCTC CCGCAGATCG TCTACAACGT CCCGAGCCGG ACGGGCCAAT CGATCCCCAT CGACGTGACC GTCGAGCTCG CCGAACACCC GAACATCCAG GGGTACAAGG CTGCCTCCGG CGACCTCAAC CTCATCAGCG AGGTGATAGA GCGCACCCGC GACGAGGAGT TCTCCGTGCT CTCCGGCGAC GACGGGCTCA CCCTGCCCGT GCTCTCGATC GGCGGCACCG GCACCATCAG CGTCGTCGGC AACGTCGAGC CGGAACGCTC CTGCGCGATG GTCGGCGCCG CGCTCTCGGG CGATTACGAC CGCGCGCGGG CGCTCCACCA CGAGCTCTCG CCGCTCGTCC GCGAACTGTT CGCCGAGACG AATCCGATCC CGGTGAAGGA GGCGATGCAC ATCCGCGGGC GGGGCGGCCC GAGCGTGCGC TCGCCGCTCA GTCGGCTCTC CGAGGACCGG CGCGAGATCC TCCAAGAGCT GCTGGCCGAG TACGACGGGG GCGCGCCCGG AGCCGTCGAT CCGGAGGCTG TCGAGCCCAC GGCGGGGGAC GCCGAATGA
|
Protein sequence | MTDTTTTTPY ATQDETDRET TDEPFEGVYP AMTTPFTDDD EVDHEQLAAN ARYLERAGVD GVVPVGSTGE SATMSHDEHV DVIETVRDAL EDVPVVAGTG SNNTAEALSL SERAADAGAD GLLLISPYYN KPEPQGFLEH YRTIADEVDL PQIVYNVPSR TGQSIPIDVT VELAEHPNIQ GYKAASGDLN LISEVIERTR DEEFSVLSGD DGLTLPVLSI GGTGTISVVG NVEPERSCAM VGAALSGDYD RARALHHELS PLVRELFAET NPIPVKEAMH IRGRGGPSVR SPLSRLSEDR REILQELLAE YDGGAPGAVD PEAVEPTAGD AE
|
| |