Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2331 |
Symbol | |
ID | 7401948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 2328965 |
End bp | 2330536 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643709404 |
Product | HTTM domain protein |
Protein accession | YP_002566977 |
Protein GI | 222480740 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.524598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGAAC GTCGAGACGC TGCCGTCGAC GCCTCCGGAT CCGCGGCCGA TGGTGCCGGA TCCACGATCG ACGACGCCGA GCCCTCGGTC CGCGGTCGTC TCCCCGCCGC GCTCCGGCGC CGGATCGGGA TCGACGTGCG CGCGCTCGCG GCGTTCCGGA TCGCGCTCGG CGCCGTCCTC CTCGTTGACC TCGCGCTGCG CGCCCGGAAC TTGACGGCCT TTTACACCGA CGCCGGCGTA CTCCCCCGAT CGCTGCTCGC GGAGTCGTCC CCCCTCGCGC GATTCTCGCT GTACGCGGTC TCCGGTGAGG CGTGGTTCGT CGGGCTGCTG TTCCTCATCG CCGCCGTCGC CGCCGTCGCG CTCGCGGTCG GCTACCGGAC ACGGATCGCG GCTGCGGTCT CGCTGGTCCT GCTCGCGTCG CTGCAGGCGC GGAACCCGTT CGTGCTCAAC GCCGGCGACA CGCTCCTCTG GCAGCTGCTC GGGGCGGGCT TGCTGTGTCC CCTCGGCGCG CGCTGGTCGG TGGATGCTGT CCGGAGGCGC GCCGCGTTGG GAGGGCGATC CCTGCCCGAA AGCAGCCGAT TTACCGGCCC CCAATCAGCC CTCCTGTTGA CCGTCGTCGT CGCGGTCTAC GTCTCCAACG CAGTCGTGAA GCTCCGCGGC GAGGCGTGGC CCGCGGGCGA GGCGGTCGGG ACCGTCTTCC GCCTCACGTA CCTCCACGGC CCGCTCGGGG GACTGATGCC CGAGAGCCCG GCGCTGCTCG CGGCCGTCAC CTACGGCTGG CTCGCGCTAC TCGTCGCGTC GCCGTTACTC GTCGCGGCCG CCGGACGAGT CCGGGCTGCG CTCGCCGGCA TCCTCGTCGC TGCCCATCTC TCGATGGCGT TCACGCTCCA GATCGGCGTC TTTCCGGTGG TGTCGGCGAC CGCACTGCTA CCCTTCTGTC CGCCGTTCGT CTGGGACCGG ATCGAGTCGC TGGCCGCTCC GGAGATCGGG CGGTTCCGGT CGATGGCAGA GCGCCTCCTC CGTTCCCTCC GGTCGACGCG ACCCGGATCG ACCCTCGTCG ATCTGGCCTC CAAAATTGTT CCCGACAGAG CGACCCGCGA ACGTCTCGTC GCCGTCATCG CCGCGCTCCT GCTCGTCTCG CTGCTCGCGT GGACCGCCAT GGGGGTCGGA GTCGTCGACG CGCCAGAGCC CGTCGTGGCG GTGTCAGATC CGGCCGAGAG CGACTGGGAT ATGTTCGCGC CGGAGCCGCC GTCGACCGAC GCGCTCGTGC TCGCGACGGC GACGACCGCC GACGGCGACC GGACCGATGC GTTGCACGGC GACCCGGTCG CGACCGACCG CACCCCGTCC GACGCGCGGG GATATCCCAC CGCCCGCTGG CGGAAGCACT TCTCGCTGCT GTCGGCCGAC GATACCGATC GCATCGACGC GACGCTCGCG CACCTCTGTG ACCGCGCAGC GGGATTTTCC GGCGCGGAGA CGGAGGCGGT GACGGTCTCC GCTGTCGAAG TCGACGTCGT CGGGAGCGAG GAGATCTGGG TCCGAGAAGC CGGTACGCGT GAGTGCCGGT GA
|
Protein sequence | MDERRDAAVD ASGSAADGAG STIDDAEPSV RGRLPAALRR RIGIDVRALA AFRIALGAVL LVDLALRARN LTAFYTDAGV LPRSLLAESS PLARFSLYAV SGEAWFVGLL FLIAAVAAVA LAVGYRTRIA AAVSLVLLAS LQARNPFVLN AGDTLLWQLL GAGLLCPLGA RWSVDAVRRR AALGGRSLPE SSRFTGPQSA LLLTVVVAVY VSNAVVKLRG EAWPAGEAVG TVFRLTYLHG PLGGLMPESP ALLAAVTYGW LALLVASPLL VAAAGRVRAA LAGILVAAHL SMAFTLQIGV FPVVSATALL PFCPPFVWDR IESLAAPEIG RFRSMAERLL RSLRSTRPGS TLVDLASKIV PDRATRERLV AVIAALLLVS LLAWTAMGVG VVDAPEPVVA VSDPAESDWD MFAPEPPSTD ALVLATATTA DGDRTDALHG DPVATDRTPS DARGYPTARW RKHFSLLSAD DTDRIDATLA HLCDRAAGFS GAETEAVTVS AVEVDVVGSE EIWVREAGTR ECR
|
| |