Gene Hlac_0065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0065 
Symbol 
ID7401420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp67238 
End bp68572 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content61% 
IMG OID643707126 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_002564741 
Protein GI222478504 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0310035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG CAGTTGAGCA GTCGAGTCCG ACTGAGGAGT CGGAGGTGCT GGTGGAAGTT 
GAGGGGCTCA AAAAGTACTA CGGAGGCGAC GGACTGTTCG CCGATCCGCC GGTGAAGGCG
GTCGACGGCG TCGACTTCGA GATCAGACGC GGCGAGACGC TCGGACTGGT CGGCGAGTCC
GGGTGCGGGA AGAGCACGCT CGGCCGTACC CTGCTGGCAC TCGAACGCGC CACCGAGGGA
TCGATCGTAT ACAACGGCAC CGACGTCACG ACGCTTTCCG GGACAGAGCT CAAAGAGTGG
CGCAAGAACG CCCAGATGGT GTTCCAAGAC CCCGAGTCCA GCCTGAACGA TCGGATGACG
GTCGGAGAGA TCATTCGGGA GCCGCTCGAC GCACACGACT GGAAGACGAT GAACGATCGG
CGTGAGCGGG TGCTGGATCT ACTGTCGGCA GTCGGTCTCC CCGACAAACA CTACTTTCGG
TACCCACACC AGTTCTCCGG CGGACAGCGG CAGCGGATAG GGATCGCACG AGCGCTCGCG
CTCGAGCCGG ACTTCCTGGT CCTCGACGAA CCGGTTTCTG CCCTCGACGT GAGCGTCCAG
GCGAAGATCA TCAGTCTCCT CGAAGACCTC CAAGAGGAGT TTAATCTCAC GTATCTGCTC
ATCGCACACG ACCTCTCGGT GGTTCGGTAT ATCTCCGATC GTGTCGCCGT GATGTACCTC
GGGAAGATCA TGGAAATGGG CGAGGCCGAA GAGCTGTTCA CAGACGCGTC AAATCCGTAC
ACACAGTCGC TGTTGTCGGC GATTCCGGAA CCCGATCCGA CCGAAACGTC TCGTCGGATA
ACCCTCTCCG GAACGCCCCC GAGCCCGAGC GACGCGCCGC CGGGCTGTAA CCTCTCGACT
CGCTGTCCGG CGAAGATTAA ACCGGAGGCG TACGCCAACC TCGACAGCGA TCTCTGGAAC
GCGATTGAAC AGTTCAGGGA GGTCGTTCGT GAACGCGCCC GTATCACGCT CTCGACGAGC
GACCGGGTCA GGCGGCGGTT CGACCGGTTC GAGCGGTTTG ACGACATCGA GGAGAGCATG
GCGGACACCT TCGATGACCT CGAGGTACCA GAGCGGGTTG ACGAACAGAT CCGGACCGCA
GTCGAGATGG TCAAGCGCGG CCGTCCGACG GAAGCCCAAG AACACCTCTA CGACGAGTTC
GCCAGTGTCT GCGATCGAGA ACCGCCAGAG ATGCACAAAG TGTCTGCGTC AGGTCGGTAC
AGCTACTGCC ACCGGCACAC CGACGAGTAC GAAGACGTGG GGCCCGTGAT TACACGCCGT
GCAGACAGCG AGTAG
 
Protein sequence
MSNAVEQSSP TEESEVLVEV EGLKKYYGGD GLFADPPVKA VDGVDFEIRR GETLGLVGES 
GCGKSTLGRT LLALERATEG SIVYNGTDVT TLSGTELKEW RKNAQMVFQD PESSLNDRMT
VGEIIREPLD AHDWKTMNDR RERVLDLLSA VGLPDKHYFR YPHQFSGGQR QRIGIARALA
LEPDFLVLDE PVSALDVSVQ AKIISLLEDL QEEFNLTYLL IAHDLSVVRY ISDRVAVMYL
GKIMEMGEAE ELFTDASNPY TQSLLSAIPE PDPTETSRRI TLSGTPPSPS DAPPGCNLST
RCPAKIKPEA YANLDSDLWN AIEQFREVVR ERARITLSTS DRVRRRFDRF ERFDDIEESM
ADTFDDLEVP ERVDEQIRTA VEMVKRGRPT EAQEHLYDEF ASVCDREPPE MHKVSASGRY
SYCHRHTDEY EDVGPVITRR ADSE