Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2402 |
Symbol | |
ID | 7400520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2392389 |
End bp | 2393789 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643709475 |
Product | PBS lyase HEAT domain protein repeat-containing protein |
Protein accession | YP_002567047 |
Protein GI | 222480810 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTACG CGCGCAGTTC ATGGTGTATG AGCAACGGCG ACGACGACCC GGCCGACGCC TCCGGAGAGG CCGACGCCGC GGACGACGCG GAGGAGGCGG GCGGCGCCGG CGAGTCGACG GATGCGGCCG CCGCGCCCAC GCTTCCCGAC GAAGCGACCG AGGAGTCCTT GAACGAGTAC CTCGACGAGA TCGCGGAGCG CCTCGACGAC GCCGAGACCG AAGCGGACCT CGACGATATC GACGCGCTCC TCGACGACGC CGAAGACGGG CTCGACGAGG CCGACCTCCC CGAGCCGGAC GAGGACGACG AGGACGCCGA CGACCCGCGC GGCGACCTCG AAGACCGGAT CGCGGAGCTC CGTGACGGCG TCGAGGAGGC CCGTGGCCCC TACGCCGAGG ACGTGATCGA TGCGATCGAG GCCGCCGCGT CGACGCTCGA AGACACCGAA TGGACCGCAG ACGGGCGCGA CGACGCCGCC GACGCTGTCG ACGCCTTCGT CGAGGAGGCG TCCGAGTCGG TCGCGGTCGA CGCGTTCCCC GAACACGACG ACCTCGACGA GCTGATCGCC GCGCTCGACG CGGTCGCCGA AGCGGTCGCT GACGCGGATC TCGATCCCGA CGCCGACGCC GAGACGATCG CAGCCCTGAT CGACGCGACG GACGGGCTCG AAGCGGGCCT CGACGACGCC GAGGAGTGGG ACGACCTCGA GACTCACGAG CAGCTCCGCG CGCAGGGGTA CTACGACGTG CTCGGCCACT ACAAGGACTT CCCGGTCGAG TGGTCCGCGC TGAAGGAACA CGAGGCGCGC GGCAACGTCG ACATGATCCT GCTCGCGCTC AACTCGCTGG AATCCGACTT CATGGAGCGC CACTGCCTCG AAGCGTTCGA ACGCATGGGC AAACGCGGAA AGACCGAGGC CTCCCTGGAG GAGCTTCTCG GCCGCGCGGA GAAGCGTGAT CAGTTCGCGA TCCGTATCCT CGGCAAGATG GCGGCCGAAG AGGCGACCGA GACGCTCGTC GAGTTCGTCT CGGAGGACTC GAACCCGCAG CTCCAGAAGA TCGTGTTCAA GGCTCTCGGC GAGATCGGCG CCGCGGAGGC GGTCCAGCCG CTCGCGGACC AGCTCGATCC CGAGGGGGAC ACCGAGGACC TCGTGCGCCC CCACGCCGCC CGTGCGCTCG GACTCATTGG CGACACCCGC GCGATCGACC CCCTCGCCGA CGCGCTCGCC GAGGACGATT CCGACGACGT CCGCGCCGCC GCCGGCTGGG CGCTCCGCCA GATCGGCACG CGCGAAGCGA TCGAAACGGT CGCCGAGTAC GCGGACGAGC ACTCCTTCAT CGTCTCCACC GAGGCCGAGA AGGCCGAGCA GTCCCTCGAC GCGGCGTCGG CGACCGCCTG A
|
Protein sequence | MDYARSSWCM SNGDDDPADA SGEADAADDA EEAGGAGEST DAAAAPTLPD EATEESLNEY LDEIAERLDD AETEADLDDI DALLDDAEDG LDEADLPEPD EDDEDADDPR GDLEDRIAEL RDGVEEARGP YAEDVIDAIE AAASTLEDTE WTADGRDDAA DAVDAFVEEA SESVAVDAFP EHDDLDELIA ALDAVAEAVA DADLDPDADA ETIAALIDAT DGLEAGLDDA EEWDDLETHE QLRAQGYYDV LGHYKDFPVE WSALKEHEAR GNVDMILLAL NSLESDFMER HCLEAFERMG KRGKTEASLE ELLGRAEKRD QFAIRILGKM AAEEATETLV EFVSEDSNPQ LQKIVFKALG EIGAAEAVQP LADQLDPEGD TEDLVRPHAA RALGLIGDTR AIDPLADALA EDDSDDVRAA AGWALRQIGT REAIETVAEY ADEHSFIVST EAEKAEQSLD AASATA
|
| |