Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0272 |
Symbol | |
ID | 7401198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 294258 |
End bp | 295703 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643707335 |
Product | domain of unknown function DUF1743 |
Protein accession | YP_002564947 |
Protein GI | 222478710 |
COG category | [R] General function prediction only |
COG ID | [COG1571] Predicted DNA-binding protein containing a Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.926897 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATCG TCGCCGTCGA CGACACCGAC TCCCGCGAGC GCGGGATGTG CACGACATAC GTCGGCGCGC GGCTGGCCGA ACGGCTTGAC GCGGCCGGAG GTCGCGTCCG TCGCCGACTC CTCGTTCGGC TCAACCCCGC GGTGAAACAC AAGACCCGGG GCAACGCCGC GGTCGCGCTC CACGTCTCTG GCGTCGAGGC CGAGGCGGCG TTCGACCTCG CCGCGGAGAC CGTCCGGGAG TTCGCGGCGG CCGACGACCC CCGGACCTCG CCCGGCGTCG TCGTCGCCGA CATCGACGTG GCGGGCGACC CGTCTGACCC AAGCGCGAGC GCCCCGCCGT TCCCTCCGAG CGCCGCGAGC ACCGCACCGA TTCCCGCCGA CGTGGCGGAC TTCGCCCGAC GCGCGCTCCG GCACCGCCTC TCGCTCGACG AGGCGCTCAC ACTCGCCGAC GATCACGGCT TCCGCCACGC GGCGTTCGGA TCCGGCGGGG AGACGAATAC GGAGGCCGTC GCCGGGCGCG GCCGGATCGG CGCGCTCGCG GCGGTCGGCG CCCCCGCAGC GTTCGACGAC TGGACCGTCG AGCGCATCTC CTACCGCGAG CTCGATCGCT GCGGTACGCC CCGCGACGTC GATATCGAGA GCGTCTTCGC GGCCGCCGAC CGGGGATACC CGACCGTCTG GGACACCGTC GACCGCGGGA CGGGCGAGGC GGTCTGCGTC CCCAACGCCC CCGGACCGAT CTTACACGGG ATCCGCGGGG ACGACGCCAA CGCGTGTCGC GAGATCGCCG AGGAGATCGC CTCTGAGCCG GTCGAGCGCA CCGCGACGTT TCTGACGAAT CAGGGCACCG ACGCGCACCT CGCGCCGGGC GCGATCGGCG ACCTCCGCGA CGGCGCGGGG TATCGCGTCG CCGGCGTCGT CGCGAGCGAG CCGGAGACGA AACGCGGGGG ACACGTCCAC GTCGACGTGG CTGCGCCCGA CGATGATCGC GTTCCGCGCC TCCGGTGTGT CGCGTTCAAA CCGACCGGTC GGTTCCGCGA CCGCGTGCGC GCTCTCCGTC CCGGCGACCG GGTGACGGTG TGCGGCGAGC ACGAGGTCCG GCTGATCGGG GATTCTGGGG GCGCCGAAGA CGACGGAAAC GACGGCCCCG ATAGCGGTCA GTCGACCGCG ACGCTGAAAC TGGAGAAGTT CGCCGTGCGC GACCTCGTCG AGACCGAGCC CGCCGTGCCG ACCTGCCCCG ACTGCGGGCG GTCGATGTCC TCGGCGGGGC GGGGGCAGGG GTACCGCTGT CGCGACTGCG GAACGGACGC ACCCGGCAAG GTCGAAGAGT CGATCGATCG GGAGTTAGAA CCCGGTTGGT ACGAGGTCCC GCCGAGCGCC CGGCGACACG TCGCGAAGCC GCTCGTCCGC GGCGGGTTCG ACGGCCCGAT TCATCCGGAG CGGTGA
|
Protein sequence | MPIVAVDDTD SRERGMCTTY VGARLAERLD AAGGRVRRRL LVRLNPAVKH KTRGNAAVAL HVSGVEAEAA FDLAAETVRE FAAADDPRTS PGVVVADIDV AGDPSDPSAS APPFPPSAAS TAPIPADVAD FARRALRHRL SLDEALTLAD DHGFRHAAFG SGGETNTEAV AGRGRIGALA AVGAPAAFDD WTVERISYRE LDRCGTPRDV DIESVFAAAD RGYPTVWDTV DRGTGEAVCV PNAPGPILHG IRGDDANACR EIAEEIASEP VERTATFLTN QGTDAHLAPG AIGDLRDGAG YRVAGVVASE PETKRGGHVH VDVAAPDDDR VPRLRCVAFK PTGRFRDRVR ALRPGDRVTV CGEHEVRLIG DSGGAEDDGN DGPDSGQSTA TLKLEKFAVR DLVETEPAVP TCPDCGRSMS SAGRGQGYRC RDCGTDAPGK VEESIDRELE PGWYEVPPSA RRHVAKPLVR GGFDGPIHPE R
|
| |