Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2036 |
Symbol | |
ID | 7402055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2028381 |
End bp | 2030072 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643709107 |
Product | protein of unknown function DUF181 |
Protein accession | YP_002566684 |
Protein GI | 222480447 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0289934 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.387863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATCG GACTTGTCGG TGACGGCCCC GCGGTCGAGG CCGTCTCGGC CGCGCTCGGC GACGTCGACG TGAACGTGAT GCCCGTCGAA GTGGGGCTGC TCGACGGGTT CGACCTCGCG GTCGTGGTCG ACACCGCCGG ATCGGAGACG TTCGCCGCCG CCAACGAGCA CCTGGACCGC TGGGTCGCAA TCGAGGTCGG CGGACTCGGC GGCGTCCCCC TGGAGCCTGT CGACGCCGCG GTGACCGCCT TCGACGAGAC CTGCTACGAC TGCCTCCGGC GACGCGTCGC CAGCGGCGGA GCCGAGCCCG CCGACGCGCC GGCCGGCCGC CGGTCGGCCG TGCGCTACGC CGGCGCGCTG GCCGGGCGCC GCGTCATCCA ACTGCTCGCC GGCGACCCGG TCGCCGACAC GGTCGTCGAG GTACCGGGTG CCGAGCGCAC CCTCCTCCCG GTTCCCGGCT GTGACTGCGG CGCCGACCCG GGCGACGCGC TGCCCCGCGA GCATCAGGAG CGATCGCTAT CGGACGCGAT CGACCGCGCG GAGCGGGCGG TCGATCCCCG AATTGGTCTG CTCTCGGAGG TGGGCGAACA GGAGTCGTTC CCCGTGCCGT ACTACGTCGC GCGGGTCGCC GACACGACGC CCTTCTCCGA CGCGCGGGCG GCCGAGTTCG GCGGCGGCGT CGACGCCGGG TGGGACGCCG CGTTCATGAA GGCGCTCGGC GAGGGGTTAG AGCGGTACGC CGCCGGAGTC TACCGCGGGG CGTCGTTCAC GCGCGCGCCG GCCGCGAACG TCCCGAATCC CGTGGCGCCC GACGCGTTCG TCCGTCCCGA GAGCGCGGAG CCGTACGCCC GCGACGACCG ACTTCCGTGG GTGACCGGCG AGCATCTCGG AACCGGGGAG GTCGCGAGTC TGCCGGCGGA GTTCGTCCAC TTCCCGCCGC CGGAGAACCG GTATCGGCCC GCGATCACGA CCGGGCTCGG CCTCGGAAAC TCCGGACCGC ACGCGGCGCT GTCGGGACTC TACGAGGCGA TCGAGCGCGA CGCGACGATG ACAAGTTGGT ACTCGACGGC CGACCCGCTC GGAATCGAGG TCGACGACGA GGGGTTCGCG GAACTGGAGA AACGCGCCCG CACCGAGTCG CTGTCGGTGA CCCCGCTCCT CGTCACGACC GACATCGACG TGCCCGTCGT CGCGGTCGGC GTGCATCGCG AGGGCGACTG GCCCCGCTTC GCGGCGGGGT CGGGCGCCGA CCTCGATCCC GCCGCCGCGG CCCGGAGCGC GCTCGCGGAG GCGCTCCAGA ACTGGACCGA ACTCCGGTCG ATGGGTCCCG ACGCGGCCGC AAAGCAGGGT GCGGCGATCG GTCGCCACGC CGACTTCCCG GCGGAGACGC GGGCGTTCTT CGACGCGGAC GCGACGGTCC CCGCCGAATC GCTCGGCGAG CCGGAGCTGT CGGGAGGCGA CGAGCTAGCC GCCGTCGTCG ACCGCGCCGA GGCGGTCGGA CTGGAGCCGT ACGTCGCGCG GACGACGACC CGAGACCTCG CCGCGCTCGG CTTCGAGGCG GTCCGCGTGC TCGTTCCCGG CGCGCAGCCG CTGTTCACCG GCGACCCGTT CTTCGGCGAC CGCGCCCGCG ACGTGCCCCG ATCGATGGGG TTCGAGCCCG ACTTGGAGAA GGCGTACCAC CCGTTCCCCT GA
|
Protein sequence | MDIGLVGDGP AVEAVSAALG DVDVNVMPVE VGLLDGFDLA VVVDTAGSET FAAANEHLDR WVAIEVGGLG GVPLEPVDAA VTAFDETCYD CLRRRVASGG AEPADAPAGR RSAVRYAGAL AGRRVIQLLA GDPVADTVVE VPGAERTLLP VPGCDCGADP GDALPREHQE RSLSDAIDRA ERAVDPRIGL LSEVGEQESF PVPYYVARVA DTTPFSDARA AEFGGGVDAG WDAAFMKALG EGLERYAAGV YRGASFTRAP AANVPNPVAP DAFVRPESAE PYARDDRLPW VTGEHLGTGE VASLPAEFVH FPPPENRYRP AITTGLGLGN SGPHAALSGL YEAIERDATM TSWYSTADPL GIEVDDEGFA ELEKRARTES LSVTPLLVTT DIDVPVVAVG VHREGDWPRF AAGSGADLDP AAAARSALAE ALQNWTELRS MGPDAAAKQG AAIGRHADFP AETRAFFDAD ATVPAESLGE PELSGGDELA AVVDRAEAVG LEPYVARTTT RDLAALGFEA VRVLVPGAQP LFTGDPFFGD RARDVPRSMG FEPDLEKAYH PFP
|
| |