Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0400 |
Symbol | |
ID | 7401017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 418413 |
End bp | 419930 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643707464 |
Product | protein of unknown function DUF58 |
Protein accession | YP_002565073 |
Protein GI | 222478836 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCCCG ATATCGCCGC TCTCCGTCGC TGTCTCGGTG GTTCAGAGCG TCCCGAACGA ACCGCAGTCG ACGGGGGATC CCGGTGGGGC TCCGCGGCCA GAACCGACGG CGGGGCCACA GCCGAACCGG CGTCCGAGAC GAACCGACCG TCAGCAGTTC GGTGGACCGG CCGCTGGCGG GGGATCGCCG CCGTCACGCT TCTCGCGGTG GCGATCGGCG TCCTCGCGAA ACGGCCTCCG TTGTTGCTGG TCGGCGCTCT CACCGGCGCG TACGTCGCCT ACCCGCGTCT CACCGCGACT CCCAACCCCG ATCTGTCCGT CCGACGGGAA ATAGATCCGG CGTCGCCGGC GGACGGGGAG TGGGTGTCCG TGCGGACGAC GATCACGAAC GAGGGAGACT CAGCGCTCGC CGACGTCCGA GTCGTCGACG GACCCCCGGC GATGCTCTCG GTCTCGGACG GATCGCCGCG ATGTGCGACG GCGCTGCTCC CGGGTGGTGA GGTGACGATA CGCTACGAGC TCCGGGCGCG CCCGGGTCGT CACGCGTTCC AGCCGACGAC CGTGCTCTGT CGCGACGCGA GCGGGTCGGT AGAGGTCGAA CTCTCACTCA CTGCGGCCAG TTCCTTCGAG TGCGAAGCGG AGATCCCGAC CGTCCCGCTA CGCGCCCAGA CCGGTCACCA TCCGGGCCCG CTGGTCACTG ACGACGGGGG CGAGGGGATC GAGTTCCACT CCGTCGAGGA GTACCGGCGC GGGGACCCCG CGAACCGGAT CGACTGGCGA CGATACGCGC GGACGGGCGA ACTCACCTCG ATGACGTTCC GCACCGAACG GCTCGCAGAG GTAGTGGTGT GCGTGGATGC ACGACCCGCG TCGTATCGAG CCGCGGACGC GACCGAACCC CACGCGGTCG CGCTCGCTGT CGACGCCGCC GGCCGAATCG GCGACGCGCT GTTCGACGCG AATCACCAGG TCGGCCTCGC CGGGTTCGGA CGGCACGTTT GCGTGCTTCC GACCCGGAAC GGTCCCGATC ACGCCAGCCG GTTCCACCGT CGGCTGGCGA CGGATCCGGC CTTCGGGCTG GATCCGCCGG AAACGGCACG CACGACCGAT CGTGAGGGCG GAACAGCTCT GCCCGGATCC GTCGCTGGGG ACGGACCGGA CGGCGGTTGG GTGACGAGAG GTGATGGGGC CGACGGGCCC GACGGGGCCG TCCCGGTCGA CACCCAGCTG TCCAGAATTC GGGCCGAGAT CGGGGCGAAC ACACAGGTCG TGTTGATCAC GCCGCTGTGT GACGACGAGG CCTCCCGCAT CGCCCAGCGG TTCGAGAGCG GTGGGACCGC CGTCAGGGTT GTCAGTCCGG ACGTCACCAC GACGGAGACC GCGGGGGGAC GGCTCGCCCG GCTGGAGCGA ACGCATCGGC TGAGCATGCT ACGGAACGCC GGTATTCCCG TCGTCGACTG GACACCCACC CAACCGCTGG GCGCGGCGAT GGCGGCCGTC GAGGGGTGGG GTCGATGA
|
Protein sequence | MTPDIAALRR CLGGSERPER TAVDGGSRWG SAARTDGGAT AEPASETNRP SAVRWTGRWR GIAAVTLLAV AIGVLAKRPP LLLVGALTGA YVAYPRLTAT PNPDLSVRRE IDPASPADGE WVSVRTTITN EGDSALADVR VVDGPPAMLS VSDGSPRCAT ALLPGGEVTI RYELRARPGR HAFQPTTVLC RDASGSVEVE LSLTAASSFE CEAEIPTVPL RAQTGHHPGP LVTDDGGEGI EFHSVEEYRR GDPANRIDWR RYARTGELTS MTFRTERLAE VVVCVDARPA SYRAADATEP HAVALAVDAA GRIGDALFDA NHQVGLAGFG RHVCVLPTRN GPDHASRFHR RLATDPAFGL DPPETARTTD REGGTALPGS VAGDGPDGGW VTRGDGADGP DGAVPVDTQL SRIRAEIGAN TQVVLITPLC DDEASRIAQR FESGGTAVRV VSPDVTTTET AGGRLARLER THRLSMLRNA GIPVVDWTPT QPLGAAMAAV EGWGR
|
| |