Gene Hlac_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1914 
Symbol 
ID7399866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1917105 
End bp1918481 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content68% 
IMG OID643708985 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002566562 
Protein GI222480325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.42815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG ACCAAGCGAT CGCCGACCCG GAGGAAACGG GCCCGCTAGA GACGTTCCGT 
CAGTTCTTCG CCTTAGAGCG TGACGTGCTG GTGGTCTCGC TGTCGATGTT CGCGTTCAGC
CTCGGGTTCC AGATGACGAG CCGGTTCCTC CCGGAGTACA TGGTGGCGCT CGGGGCGTCC
GGGTTCGTCG TCGGGTTGTT CGGGACCTTC GGCAACGTCA TCTCCGCGGT GTACCCGTAT
CCGGGCGGCG CGATATCCGA CCGCATCGGG TCGCGGTACG CGCTGACGGC CTTCGGACTC
GTTTCGACCG TCGGCTTCGT CGTCTGGCTG ATCGCCCCGA ACGTCGGGGC GGTCACGGTC
GCGGGCGTGA CGATCGAACC GTGGATCTGG ATCTTCGTCG GGCTCGTGCT CGCGCAGGCG
TGGAAGTCGT TCGGCCTCGG CGCGACCTTC GCCGTGGTCA AGCAGGCGAC GGACCCGTCC
CGGCTGGCGG CCGGGTTCGC GAGCACGGAG ACGTTCCGAC GCACCGCGCT CCTGATCGGT
CCCGTCCTCG CGGCGATTCT CATCGACCTC CATCCGGCGT TCACCGTGAG CTTTCGGTAC
GTGCTCGCGG TGGCGGTCGT CTTCGGCGTC GTCGGGACGC TCGTGCAACA CGTCCTGTAC
GACGCGAGCG GGGACGCCGT CGGTGGCGGG CGGTTCGAGG GCGTCTCCCG GATCCGGACG
GACCTCCGCG AGATGCCCGA CCCGCTCCGG CCGCTGCTGA TCGGCGACAC TCTCGTCCGC
TTCGCGAACG GGATGGTGTA CGTCTTCTTC GTGCTCGTCG TCACGCGGAT CTTCTCGGTG
GGACTAGAGA GGACGGTCGC GGTCGGCGGG ATTTCCTACG CCGTGGACCT CTCGCCGCAG
GCCTTTTTCG GCTACCTGCT GGGCGTCGAG ATGGTCGTCG CGCTGATCAC GATGGTCCCC
GCGGCGAAGC TGGCCGAGCG GGTCGGGCTC AAGCCGATCG TCGCGCTCGG CTTTTTCGTG
TACGGCGTGT TCCCTCTCGT CCTCGTCTTC GGTCCCGACC TCCTCTCGCC GTTCGTCTCG
ATCCAGTGGG CGCTGGTCCT CGTCTTCGCG TTCTCCGGGC TCCGGTTCGC CGGCCTCCCT
TCGCATAAGG CGCTCATCGT CGGTCCCGCC GAACAGGGCG CCGGCGGCCG GGTCACCGGG
ACCTACTACC TGCTGCGAAA CACGATCGTC ATCCCGAGCG CCGCGATCGG CGGCTACCTC
TGGGACTTCG TCAGTCCGGA GGTCGCCTTC ACCGTCGCCG CCGTGATCGG CGTCGCCGGG
ACCGGCTACT TCCTCGTCTT CGGCGAGGAG TTCGAGGCGT ACGCCCGGGG TCGGTGA
 
Protein sequence
MSKDQAIADP EETGPLETFR QFFALERDVL VVSLSMFAFS LGFQMTSRFL PEYMVALGAS 
GFVVGLFGTF GNVISAVYPY PGGAISDRIG SRYALTAFGL VSTVGFVVWL IAPNVGAVTV
AGVTIEPWIW IFVGLVLAQA WKSFGLGATF AVVKQATDPS RLAAGFASTE TFRRTALLIG
PVLAAILIDL HPAFTVSFRY VLAVAVVFGV VGTLVQHVLY DASGDAVGGG RFEGVSRIRT
DLREMPDPLR PLLIGDTLVR FANGMVYVFF VLVVTRIFSV GLERTVAVGG ISYAVDLSPQ
AFFGYLLGVE MVVALITMVP AAKLAERVGL KPIVALGFFV YGVFPLVLVF GPDLLSPFVS
IQWALVLVFA FSGLRFAGLP SHKALIVGPA EQGAGGRVTG TYYLLRNTIV IPSAAIGGYL
WDFVSPEVAF TVAAVIGVAG TGYFLVFGEE FEAYARGR