Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2723 |
Symbol | |
ID | 7401334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2713362 |
End bp | 2714678 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643709798 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002567364 |
Protein GI | 222481127 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCTCA CGATCGCGCT CGTCGGCGTG GCCGCAATCC TCGTGTTGAC GGGGATCAGC GCCTTCTTCT CCAGTTCGGA ACTCGCCGTG TTCTCGGTCC CGAGCCATCG CGTCGACTCG CTGCTCGCCG CCGACGTACC CGGCGCGCGG GCGCTATCGG CGCTGCGCGC AGACTCGCAT CGCTTCCTCG TTACCGCCCT GGTGAGCAAC AACGTCGCCA ACATCGCGGC CGCGTCGGTC GCGACTGCGG TGTTCGTCCG GTTCGGCTTC TCCGGTGGCG AGGCGGCCAC GGGCTCGACG CTGGTCACCT CCGTGTTCGT CATCGTGTTC GGCGAGATCG CGCCGAAATC GTACGCGGTG GCGAACGCGG AGAAGCACGC CCTCCGCGTG TCGCGACCGG TGGTCGCGAT CCAGCGCCTT ATCCGACCCG TGTTATACAT CTTCGAGGCG CTCTCCGGCG TCGTCAATCG CTTCACCGGC GGCGAATCCG ACATCGAATC GTACCTCACG CGCGAGGAGA TCGAGACGCT CGTGCTCTCC GGCGAGGCGG CGGGCGCGCT CGACCCGGAC GAGGGCGCGA TGATCCGCGG CGTCCTCGAT CTGGAGTCGA CCCGCGTGTC GGCGGTGATG GTCTCTCGGA CCGACATGGT CGCGCTGCCG GACACCGCCA CGCCCGCCGA GGCGGTCTCG ACCGCCGCCG CGGAGGGCGT CACGCGGATG CCGGTGTACA GCCAGAACCG CGACGACGTG GTCGGCGTCG TCGACCTGCG CGACGCGATC GGCGCCAACG AGCGCGGAGA GCCCCTCGCA AGCGCGCTCC ACGAGCCGAC CTTCGTCCCG GAGACGCAGC CGGTCGACGA GCTGTTCGCG ACAATGCGGT CGAGCGCTCT CCGGATGGCG ATCGTCGTCG ACGAGTTCGG CGCGGTGGTA GGGATCGTGA CCTTGGAGGA TGTACTCGAG GAGATCGTCG GCGAACTGGT CGGGGGATGG GAGACCGACC ACGTCGACGT GGTCGCGCCC GACGCCGCGG TGGCCCGCGG GTGGACCACC GTCGCGCACC TCAACGAGAC GCTCGGGCTG GACCTCCCGA TCGACGGGGG CACCGAGACC GTCGCGGGAC TGGTGACCCG GCAGCTCGGG CGGGTCCCCG CTGAGGGCGA CCGCGTCGAG ATCGGCGACG TGACGCTCGC GGTCACGGGC GCGACCGCGA CGCGAGTGAC CCGAGTGCGG GTGGAGCATC CGGGGATCGG AACCGAGGGC GAAAGCGGAT CGGACATCTC CGGGTCCCCG GACGCGACGG GCGACGACAC CGAGTGA
|
Protein sequence | MDLTIALVGV AAILVLTGIS AFFSSSELAV FSVPSHRVDS LLAADVPGAR ALSALRADSH RFLVTALVSN NVANIAAASV ATAVFVRFGF SGGEAATGST LVTSVFVIVF GEIAPKSYAV ANAEKHALRV SRPVVAIQRL IRPVLYIFEA LSGVVNRFTG GESDIESYLT REEIETLVLS GEAAGALDPD EGAMIRGVLD LESTRVSAVM VSRTDMVALP DTATPAEAVS TAAAEGVTRM PVYSQNRDDV VGVVDLRDAI GANERGEPLA SALHEPTFVP ETQPVDELFA TMRSSALRMA IVVDEFGAVV GIVTLEDVLE EIVGELVGGW ETDHVDVVAP DAAVARGWTT VAHLNETLGL DLPIDGGTET VAGLVTRQLG RVPAEGDRVE IGDVTLAVTG ATATRVTRVR VEHPGIGTEG ESGSDISGSP DATGDDTE
|
| |