Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1950 |
Symbol | |
ID | 7399902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1950028 |
End bp | 1951068 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643709021 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_002566598 |
Protein GI | 222480361 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.863146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.537489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCG GGCAATCGTT TCTCCTTCTC CTCATCGGAT CGGTCGCGCT CCTCACGCTT TTCGTCATCA GACCGTTTAT CGAGTACGTC ATCGCCTCCG CGATCCTCGC GTACGTTCTC TTCCCGTTCC ACGTGCGGCT CTCGCGGGGG CTTCAAGAGG GACTCTCCAA CCGATTCCGC GAGTCGCTTG CGCGTCAGTT GGGGTACATG CTGTCCGCGC TCTTCTTGAT CGTCTCGTCG ATCGTCGCCG TCATCCTGCC GCTCGCGTAC ATCTCTTGGG TGTTCGTCCG CGACCTCACG GAGATCGCCC GGGGGAACTC CGATATCGAC GTCGAGGCCA TCGAGACGGA GCTTGCGGCG CTCACAGGCG AACAGATTGA AGTCGGCGAG GTTCTCACAA CCGTCGGACA GGTGCTCGCC AACACGCTGT TCGGCGGACT GGGCGGGATC GTCACCACCG CGCTCCGCGC GTCGGTCGGA CTCTCGCTGG CGCTATTTCT GGTTTACTAC ACCCTGCTCG ACGGGCCGGC GTTCGTTCGG TGGCTCCGTC AGACGAGCCC GCTTCCGGCG GATGTCACGT CCGACCTCGT CGATCGGGTC GACGCGATGA CCCGCGGGGT CGTGATCGGT CACATCGTCG TGGCGCTGTT GCAGGCGCTG GTCGCGGGAC TCGGCCTCTG GGCGGCGGGG ATCCCGAACG TCGTCTTCTG GACGTTCGTG ATGGCGGTGT TGGCGCTGGT GCCGCTAGTC GGAGCGTTCT TCGTCTGGGG GCCCGCAGCG GCGTACCTCG TCGCGATCGA CCAGGTGACG GCCGGAGTGT TCCTCGCGAT ATACGGGGTC CTCGTCATCG CGATGGTGGA CAACTACGCG CGCCCGATCG TTATCGACCA GCAGGCGCAC CTGAATCCGG CGGTGATCCT CCTCGGGGTG TTCGGCGGGA TCTACTCCAT CGGATTCACC GGACTGTTCG TCGGTCCCAT CGTCATCGGG GTACTCGCGG CCACGCTGGA GACGTTCCGG GAGGATTACG ACCTTATCTA A
|
Protein sequence | MNRGQSFLLL LIGSVALLTL FVIRPFIEYV IASAILAYVL FPFHVRLSRG LQEGLSNRFR ESLARQLGYM LSALFLIVSS IVAVILPLAY ISWVFVRDLT EIARGNSDID VEAIETELAA LTGEQIEVGE VLTTVGQVLA NTLFGGLGGI VTTALRASVG LSLALFLVYY TLLDGPAFVR WLRQTSPLPA DVTSDLVDRV DAMTRGVVIG HIVVALLQAL VAGLGLWAAG IPNVVFWTFV MAVLALVPLV GAFFVWGPAA AYLVAIDQVT AGVFLAIYGV LVIAMVDNYA RPIVIDQQAH LNPAVILLGV FGGIYSIGFT GLFVGPIVIG VLAATLETFR EDYDLI
|
| |