Gene Hlac_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0066 
Symbol 
ID7401421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp68569 
End bp69657 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content60% 
IMG OID643707127 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_002564742 
Protein GI222478505 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.869927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.262294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAAC CACAGAACAC AACCGCGGAA GCGACGGAGG GCCGTGAAGA TCCGATCCTC 
TCCGTGAGAG ACCTCCAGAC GGTTTTCTAC ACCGATAACG AGACGATCCG AGCGGTCGAC
TCGATCAGTT TCGACGTCGG TCGCGGCGAA ACGGTGGGAA TTGTCGGCGA GTCTGGATCG
GGCAAAAGCG TTACCGCTCG CTCGATCATG GGGTTGATCG AGTCGCCCGG CAAGGTACTA
CCGGGGTCGT CGATCTCCTT CGACGGACAG GAACTCACAG ACCTCTCGGA GAAACAGTAC
CAGTCGATCC GTGGAAGCGG GATCGGGATG GTCTTTCAGG ACCCCCAACA GTCGCTTAAC
CCCGTCTACA CTGTCGGCAA CCAGATCCGT GAATCGCTTG CAATCAACCG CGATATCACG
GGTGAAAAGG CGACCACGGA GGCGACGAGA CTGCTCCGGG CCGTCGGTAT TCCGGATGCC
AAACGCCGGT TGGACGAGTA TCCACACGAG TTCTCCGGCG GGATGCGTCA GCGTGCCGTC
ATCGCGATGA TGCTCGCGTG TGACCCGGAC TTTCTGATCT GTGACGAACC GACGACTGCA
CTCGACGTGA CGATCCAGGC ACAGATCCTG GAACTCTTAA AAGAGCTCCA AGAGGAACGC
GGCCTGTCGA TACTGTTCAT CACGCACGAC ATGGGTGTCG TCGCGGACAT CGCAAACCGT
GTGAACGTGA TGTACGCGGG ACAGATTGTC GAGAAGGCGA CCGTCGAAGA TCTGTTCGCG
AACCCACAAC ACCCGTACAC GAGAGCGCTG CTCAGGTCCA TTCCCGGTGA ACACTCCCGT
GCGGACGGGC TCGAAACAAT CGAGGGGCAG GTGCCGACAC CGAACGAACC GGCCGATCAC
TGTCGGTTCG CGCCTCGGTG TTCGAAGGCG TTCGACGCCT GCGAGAACGT GCCACCACAG
CACGTTCCTG TCGGGGACAG CGAGGATCAC ACGGCGTCGT GTCTGTTGTA CCCCGACGAC
CTGAGTGAGA CGGCGCGCGT TGAGAGACAC GTCGAGAGAG CGACCGAACG AACGGAGGAA
CACTCATGA
 
Protein sequence
MSQPQNTTAE ATEGREDPIL SVRDLQTVFY TDNETIRAVD SISFDVGRGE TVGIVGESGS 
GKSVTARSIM GLIESPGKVL PGSSISFDGQ ELTDLSEKQY QSIRGSGIGM VFQDPQQSLN
PVYTVGNQIR ESLAINRDIT GEKATTEATR LLRAVGIPDA KRRLDEYPHE FSGGMRQRAV
IAMMLACDPD FLICDEPTTA LDVTIQAQIL ELLKELQEER GLSILFITHD MGVVADIANR
VNVMYAGQIV EKATVEDLFA NPQHPYTRAL LRSIPGEHSR ADGLETIEGQ VPTPNEPADH
CRFAPRCSKA FDACENVPPQ HVPVGDSEDH TASCLLYPDD LSETARVERH VERATERTEE
HS