Gene Hlac_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1950 
Symbol 
ID7399902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1950028 
End bp1951068 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content64% 
IMG OID643709021 
Productprotein of unknown function UPF0118 
Protein accessionYP_002566598 
Protein GI222480361 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.863146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.537489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCG GGCAATCGTT TCTCCTTCTC CTCATCGGAT CGGTCGCGCT CCTCACGCTT 
TTCGTCATCA GACCGTTTAT CGAGTACGTC ATCGCCTCCG CGATCCTCGC GTACGTTCTC
TTCCCGTTCC ACGTGCGGCT CTCGCGGGGG CTTCAAGAGG GACTCTCCAA CCGATTCCGC
GAGTCGCTTG CGCGTCAGTT GGGGTACATG CTGTCCGCGC TCTTCTTGAT CGTCTCGTCG
ATCGTCGCCG TCATCCTGCC GCTCGCGTAC ATCTCTTGGG TGTTCGTCCG CGACCTCACG
GAGATCGCCC GGGGGAACTC CGATATCGAC GTCGAGGCCA TCGAGACGGA GCTTGCGGCG
CTCACAGGCG AACAGATTGA AGTCGGCGAG GTTCTCACAA CCGTCGGACA GGTGCTCGCC
AACACGCTGT TCGGCGGACT GGGCGGGATC GTCACCACCG CGCTCCGCGC GTCGGTCGGA
CTCTCGCTGG CGCTATTTCT GGTTTACTAC ACCCTGCTCG ACGGGCCGGC GTTCGTTCGG
TGGCTCCGTC AGACGAGCCC GCTTCCGGCG GATGTCACGT CCGACCTCGT CGATCGGGTC
GACGCGATGA CCCGCGGGGT CGTGATCGGT CACATCGTCG TGGCGCTGTT GCAGGCGCTG
GTCGCGGGAC TCGGCCTCTG GGCGGCGGGG ATCCCGAACG TCGTCTTCTG GACGTTCGTG
ATGGCGGTGT TGGCGCTGGT GCCGCTAGTC GGAGCGTTCT TCGTCTGGGG GCCCGCAGCG
GCGTACCTCG TCGCGATCGA CCAGGTGACG GCCGGAGTGT TCCTCGCGAT ATACGGGGTC
CTCGTCATCG CGATGGTGGA CAACTACGCG CGCCCGATCG TTATCGACCA GCAGGCGCAC
CTGAATCCGG CGGTGATCCT CCTCGGGGTG TTCGGCGGGA TCTACTCCAT CGGATTCACC
GGACTGTTCG TCGGTCCCAT CGTCATCGGG GTACTCGCGG CCACGCTGGA GACGTTCCGG
GAGGATTACG ACCTTATCTA A
 
Protein sequence
MNRGQSFLLL LIGSVALLTL FVIRPFIEYV IASAILAYVL FPFHVRLSRG LQEGLSNRFR 
ESLARQLGYM LSALFLIVSS IVAVILPLAY ISWVFVRDLT EIARGNSDID VEAIETELAA
LTGEQIEVGE VLTTVGQVLA NTLFGGLGGI VTTALRASVG LSLALFLVYY TLLDGPAFVR
WLRQTSPLPA DVTSDLVDRV DAMTRGVVIG HIVVALLQAL VAGLGLWAAG IPNVVFWTFV
MAVLALVPLV GAFFVWGPAA AYLVAIDQVT AGVFLAIYGV LVIAMVDNYA RPIVIDQQAH
LNPAVILLGV FGGIYSIGFT GLFVGPIVIG VLAATLETFR EDYDLI