Gene Hlac_0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0229 
Symbol 
ID7402158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp247443 
End bp248768 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content65% 
IMG OID643707292 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_002564904 
Protein GI222478667 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00108704 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.192891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAACTG CAAGTCCGGA TGAAGAAAAC AAGCGAAACC GAACCAGAGA TATCGAGACG 
AATTCGGAAA GCCGCTCGCC GGTCGGTCGG CGACGATTCC TGCAGGCCGC CGGCGCCGGT
GCGACCGTCG CGTTGGCCGG CTGCTCGGGC GGGAGCGGCG ACGACGACGT GGTTCGGCTG
GGGGCGACGT ACATTCTCTC CGGATTCGCC TCGCTGTACG GCGAGGCCGC CGAGCGGGGC
CTCGAAATGG CCCAGTCGGA GATCAACGAC AACGGCGGCA TCGACGGACG CGACGTGGAG
GTGACCGTCC GAGACACCGA GGCGAGCGCC GATACGGCGA TCCAGCAGAT GCGGAGCCTC
GTCGAGGAGG ATAACGTCCA CGGGTTGTTC GGGCTAGACT CCAGCGGCGT CGCACAGGCC
GTGGCGCCGC AGGCGGCACA GCTCCAGATG CCGTACATGA TCACGCACGC GGCAACTCCG
TTCGTCACCT CGCCCGAAGG CGAACACGAA GACTCCGTCG GCAACGACTA CGTGTTCCGC
GACTCCAACA GCCTCTCGCA GGATATCTAC GGGGCGGCAC TGACCGCCGC CGAACTCGAC
GCCACGGAGT GGGCGACGAT CGGCCCCGAC TACGCGTTCG GCTACGAGAC GTGGGACTAC
TTCCAAGACT TCTGTGCGGG TCTCGGTGTC GACGCCGAGT TCACCGCCGA ACAGTTCCCG
CCGCTGGAGA CGGGCGATTA CACGCCGTAC ATCAGTTCGA TCCTCGACGC CGAGCCGGAC
GCCGTGCTCA CGCCGCTATG GGGTGCCGAC CTCACCACGT TCATCGGACA GGCAGAGGAT
GCGGGCTGGT TCGATCAGAT CGACCACACG CTGTTCAGCG TCGGGATGGG AACCGATCTC
GCGAGCGACG GGAACCCGCT CCCGGAAGGC GAGTACGCCT CGACTCGGTA CGACCCGTTC
GTCCCCGACA CCGAGGCAAA TAACACGTTC CGCGACACGT ACTACGAGGA ATACGACGCG
CTCCCGACGT ACAACGCTGA GGGCGCGTAC CGCGCCGTCT ACCTGTACAA GGAAGCGATC
GAATCGGCCG GAAGCACCGA AGCCAGCGCG CTCGTCGAGG AGTTCACCGG GATGGAACAC
GCCGGTCCGG TCGGTGACTA CCAGTTCAGC GAGACGAATC AGGCGACGGT GTCGTCCATC
TGGGGAACGG TCTCGTACGA CGAGGAATGG GAGAGCAACG TGCTCGATCC CGTGAACAGG
TACGAGGCCG GCCCCGACGA GCTGTCCGAG GCGCTCGCCG ACTCCGATCT GCCGACGGGC
ATCTAA
 
Protein sequence
MVTASPDEEN KRNRTRDIET NSESRSPVGR RRFLQAAGAG ATVALAGCSG GSGDDDVVRL 
GATYILSGFA SLYGEAAERG LEMAQSEIND NGGIDGRDVE VTVRDTEASA DTAIQQMRSL
VEEDNVHGLF GLDSSGVAQA VAPQAAQLQM PYMITHAATP FVTSPEGEHE DSVGNDYVFR
DSNSLSQDIY GAALTAAELD ATEWATIGPD YAFGYETWDY FQDFCAGLGV DAEFTAEQFP
PLETGDYTPY ISSILDAEPD AVLTPLWGAD LTTFIGQAED AGWFDQIDHT LFSVGMGTDL
ASDGNPLPEG EYASTRYDPF VPDTEANNTF RDTYYEEYDA LPTYNAEGAY RAVYLYKEAI
ESAGSTEASA LVEEFTGMEH AGPVGDYQFS ETNQATVSSI WGTVSYDEEW ESNVLDPVNR
YEAGPDELSE ALADSDLPTG I