Gene Hlac_0243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0243 
Symbol 
ID7401169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp260893 
End bp263604 
Gene Length2712 bp 
Protein Length903 aa 
Translation table11 
GC content67% 
IMG OID643707306 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_002564918 
Protein GI222478681 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain
[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.351825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCTC TACTTTCGCT CTCTGACGTC CGTACCCAGT TCTCGACGGA ACGGGGACAG 
GTGAAGGCGG TTGACGGAAT CGATCTGGAG ATCCGCGAAG GCGAGACCGT TGGACTCGTC
GGCGAATCGG GCTCCGGGAA AAGCGTCACG GCGCTGTCGA CGATGGGGCT CGTCGACGAC
CCCGGCGAGA TCGTCGGCGG AACCGTCGAG CTGACGGACG CCCGAGTTGC GGACGAGCTC
CGCGACCGGT ACGACGCCGC TGAATTCGTC GACGGCGACA CGATCGATCT CACGGCCGCC
CCCGAGGAGG CGCTTCGGTT CGTCCGCGGC CGGGAGATAA GCATGATCTT CCAAGACCCG
ATGACGTCGC TGAACCCTTC AGTGACGGTC GGCGAGCAGG TCGCAGAGAG TCTCAAACTC
CATCAGTACG GCGGGCGTCG CAAGGACTCG TGGTTCAACG CGGTGCGCGA GATCCTCCCG
AAGATCAGCC GCGACGTGGA CGACGAAGTC CGCGAAGAGA CCATTGAGGT GCTTGAAGAG
GTCGGCATCC CGGAACCGGG CGCGCGGATC GACGAGTACC CGCACGAGTT CTCCGGCGGG
ATGCGCCAGC GGGTGCTCAT CGCGATCGCG TTAGCCTGTC AGCCGGGGCT GCTCGTGGCC
GACGAGCCGA CGACCGCGCT GGACGTGACC ATTCAGGCGC AGATCCTCGA CCTGATAGAC
GACCTGCAGG CCGACCTGGG GATGTCGGTG CTGATGATCA CTCACGACCT CGGCGTCGTG
GCCGAGACGT GCGACCGCGT CGCGGTGATG TACGCCGGCG AGATCGTCGA AGAGGGCCCC
GTCGAGGAAA TCTTCGGGAA CCCGTCGCAC CCGTACACGT ACACGCTCCT CGAGTCCCTC
CCGAGCGAGG AGAAAGAGCG CCTCACGCCG ATCGAGGGGA ACGTCCCCGA CCTCATCGAT
ATGCCGTCGG GGTGTCACTT CGCGCCGCGG TGCCCGTGGG CGACCGAGGA GTGTACGAGC
GGCGAGATCC CGTACCTCCA GCACGGCGCC GAGGATGTCG ACCATCGGTC GAAGTGCATC
ATGGAGTCGT TCGACAAGAA CGAGTACGGC GACGACGCTG TCGGCCCCGG GCGCGACCGG
TCGATCGGCG AGCCCCTCGT CGAGATCGAC GGACTCCGGA AGTACTACGA ACAGACGGAC
AGCGTGCTCG ACCGGGTTCT CGGTGCCGAC GACCGCAGCG TGAAGGCGGT CGACGGGATC
GACTTCACGA TCAACCGCGG CGAAACGCTG GGACTCGTCG GCGAATCCGG TTGCGGGAAG
TCGACGGCCG GGCGAGCGCT GTTACACCTC ACCAAACCGA CCGACGGCCG GGTGGTGTTC
GCCGGGACCG ACCTCACGGA ACTCGACGGA TCGGCGCTGC GAGAGCAGCG GAAGAACCTC
CAGATGATCT TCCAAGACCC GCTCTCCTCG CTCGATCCGC GACAGACGGT CGGCCAGACG
ATCCGCGAGC CGCTGTCGAT CCACGACTTG CCCGAGAGCG ACCCGAGCGT GGCGACCGAG
GCCGAGGTGA CCGTCTCCGG GATCGCCCGC GACAGGGTCG GCGTGACGGT CGACGACGAG
ATCGACGCCG TCGTCGGCTC CGGGAGCGGC GTGGCGACGG CCCACGTCGA CGTAACGGTC
GCGGACGGCG AGGTTGACGT CGACGTGCGC GAACACCTCG GGGTCGAGGG GACGGTCGAG
CGCACTGACG CCGGCGACGT GGAACGCGTC TCGGTGACGG TCTCTGCCGG CGACACCGAC
CGGCTCAGGC GGCGGCGTCG CGTCCGACAA CTGCTGGAGG CAGTCGGTCT CGAGGTGGGC
CAGTACGACC GGTACCCCCA CGAGCTGTCC GGCGGCCAGC GCCAGCGCGT CGGCATCGCT
CGGGCACTCG CGGTCGATCC GGAGTTCATC GTCGCCGACG AGCCGGTGTC GGCGCTCGAC
GTGAGCGTGC AGGCGCAGAT CCTCAACCTG ATGGAGGATC TTCAGGACCG GTTCGACCTG
ACGTACCTGT TCATCGCGCA CGATCTCTCG GTGGTGCGCC ACATCAGCGA CCGCGTCGCC
GTGATGTACC TCGGCGAGAT CGTCGAGGTG GCGGCGACCG ACGAGCTGTT CGCGGACCCG
CAACACCCGT ACACGAAGGC GTTGCTCTCC GCGATCCCCG CGCCGGACCC GACGGTGGAC
ACCGACGATC GCGTCATCCT CGAAGGCGAC GTGCCGTCGC CGATCGACCC GCCATCGGGG
TGTCACTTCC GGACCCGGTG TCCGTCGGTG ATCCCCCCCG CCGATCTCGA CATCAAACAG
GAGACGTACC GTGAGGTGAT GAACTACCGA CAGCGGGTCG ATCGACAGGC GATCGACGTC
GAGACGATCC TCGAGGCGGC CGACGAATCG CCCGGACAGG TAGCGGCCGA CGGCGGCACC
GCGTCCGCTG CCCCGTCCGG GGGTGACGCG ATTCCGCCCG GTGCGGTCGA GGCGGTCCGC
GACGTCCAGT TCGATCAGTA CCCGGACGGA CGCGCCGGCG AGGTCGTCGA TCGCTCCCTC
AGACTGGTGA TTGCCGGCGA GTGGGAGGAG GCGGCCGATA TCCTCGAGGA GACGTTCGCC
AGCGTCTGCG AGCGCGAGGA GCCCACGCTC CCCGACGGCG ACCATCCGGC TGCGTGCCAC
CTGATCGACT GA
 
Protein sequence
MPPLLSLSDV RTQFSTERGQ VKAVDGIDLE IREGETVGLV GESGSGKSVT ALSTMGLVDD 
PGEIVGGTVE LTDARVADEL RDRYDAAEFV DGDTIDLTAA PEEALRFVRG REISMIFQDP
MTSLNPSVTV GEQVAESLKL HQYGGRRKDS WFNAVREILP KISRDVDDEV REETIEVLEE
VGIPEPGARI DEYPHEFSGG MRQRVLIAIA LACQPGLLVA DEPTTALDVT IQAQILDLID
DLQADLGMSV LMITHDLGVV AETCDRVAVM YAGEIVEEGP VEEIFGNPSH PYTYTLLESL
PSEEKERLTP IEGNVPDLID MPSGCHFAPR CPWATEECTS GEIPYLQHGA EDVDHRSKCI
MESFDKNEYG DDAVGPGRDR SIGEPLVEID GLRKYYEQTD SVLDRVLGAD DRSVKAVDGI
DFTINRGETL GLVGESGCGK STAGRALLHL TKPTDGRVVF AGTDLTELDG SALREQRKNL
QMIFQDPLSS LDPRQTVGQT IREPLSIHDL PESDPSVATE AEVTVSGIAR DRVGVTVDDE
IDAVVGSGSG VATAHVDVTV ADGEVDVDVR EHLGVEGTVE RTDAGDVERV SVTVSAGDTD
RLRRRRRVRQ LLEAVGLEVG QYDRYPHELS GGQRQRVGIA RALAVDPEFI VADEPVSALD
VSVQAQILNL MEDLQDRFDL TYLFIAHDLS VVRHISDRVA VMYLGEIVEV AATDELFADP
QHPYTKALLS AIPAPDPTVD TDDRVILEGD VPSPIDPPSG CHFRTRCPSV IPPADLDIKQ
ETYREVMNYR QRVDRQAIDV ETILEAADES PGQVAADGGT ASAAPSGGDA IPPGAVEAVR
DVQFDQYPDG RAGEVVDRSL RLVIAGEWEE AADILEETFA SVCEREEPTL PDGDHPAACH
LID