Gene Hlac_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0472 
Symbol 
ID7400352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp490987 
End bp492156 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content68% 
IMG OID643707536 
ProductABC transporter related 
Protein accessionYP_002565144 
Protein GI222478907 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3842] ABC-type spermidine/putrescine transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.745799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.301247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCACA CGCAATCAGT CAAACCGACA CAGTCGGCTG GGGACGCAAG CGAGTCGCCC 
GCAGAAGCGA ACGAATCGCA CGCCGGCGCG ACCGAGTCGG CCACCGACGC GTCGCCGGAT
CCGGTGTTGT CGCTGTCGGG GGTGACGAAG GAATTCGGCC CGGAGACCGC CGTCGACGGC
GTGTCACTCG ACGTCAATCC GGGAGAGTTG CTCACCTTCC TCGGGCCGTC CGGCTGCGGG
AAGACGACGA CGCTCCGGAC CATTGCCGGG TTAGAGGAAC CGACCGAGGG ACAGATTACG
CTCGGCGACG AGACCGTCTC CGGCGACGGT GCGTTCGTCC CCCCCGAACA GCGCGATGTC
GGGATCGTCT TCCAGAACTT CGCGCTGTTT CCCCACCTCA CCGTCCGGGA GAACATCGCG
TTCGGGCTCG GCGACGGCGC GGACGCGACC GCAGACCGCG TCGACGAGAT GCTCGACCTC
GTCGATCTGC CCGAACACGG CGAGAAGACG CCGGACCAGC TCTCCGGTGG GCAGAAACAA
CGCGTCGCAC TCGCGCGGTC GCTCGCGCCG GAGCCCGACG TGCTCCTCCT CGACGAGCCG
TTCTCGAACC TCGACGTGCG GCTCCGCGTC GAGATGCGCG AGGAAGTCCG CCAGATCCTG
AAAGCGGCCG GCGTCACCGC GGTTTCCGTC ACCCACGACC AGGAGGAGGC GCTCTCCATC
TCTGACCGCG TCGCCGTGAT GAACGAGGGA CAGATCGAGC AGGTCGGCCG CCCCGAATCG
GTGTTCGAGC GCCCCGAGTC GAAGTTCGTC GCCTCCTTCC TCGGGCGGGC GTCGTTCCTC
GAAGGGCACC TCCGCGACGG GAAAGTCGAG ACCGGAATCG GCCGGTTCGA CGCCGTCACG
CTGGAGGGCT ACGACACCGT CTACGACGGG ACGCCAGTCG ACGTGCTCGT GCGCCCCGAC
GACCTACGGG CGAGCCCCGC GAGCGCCGAG CTGGCCGACG GGACCATCGT CTCCCGGCAG
TACGTCGGTC CCTCGTTCAT CTACCGGGTC GAACTCGAGT CCGGGGACGT CGTCCACTGC
CTCCACAACC ACGTCGAGGA GTTCGACCTC GACGAATCGG TGGGCTTGGA GCTCACCGCC
GATCACCCGC TGGCCTGGTA CCCGCGGTAG
 
Protein sequence
MAHTQSVKPT QSAGDASESP AEANESHAGA TESATDASPD PVLSLSGVTK EFGPETAVDG 
VSLDVNPGEL LTFLGPSGCG KTTTLRTIAG LEEPTEGQIT LGDETVSGDG AFVPPEQRDV
GIVFQNFALF PHLTVRENIA FGLGDGADAT ADRVDEMLDL VDLPEHGEKT PDQLSGGQKQ
RVALARSLAP EPDVLLLDEP FSNLDVRLRV EMREEVRQIL KAAGVTAVSV THDQEEALSI
SDRVAVMNEG QIEQVGRPES VFERPESKFV ASFLGRASFL EGHLRDGKVE TGIGRFDAVT
LEGYDTVYDG TPVDVLVRPD DLRASPASAE LADGTIVSRQ YVGPSFIYRV ELESGDVVHC
LHNHVEEFDL DESVGLELTA DHPLAWYPR