Gene Hlac_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1149 
Symbol 
ID7400958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1156156 
End bp1157484 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content66% 
IMG OID643708214 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_002565813 
Protein GI222479576 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCAA ATAGCATGGA CTTGGTCGAC CGGAGAACGC TGCTGAAACT GACGGGAGGA 
GCGGGCGTTG GCGCGCTCGC GGGCTGTCTC AGCACGACCG ACGACGGCGA AGACGGATCC
GACGGGAGCG ACGGGAGCGA CGGGAGTGAC GGGAGCGATG GCGAGAACGG AACCGACGGC
GACGACGGGA GCGACGACGG AGGAGACTCG ACCGACGCCT ACGAGATCGG GATGGTCGAC
TCGCAGACCG GGTCGCTGTC GGCGTTCGGC GAGCGGAACC AGCGCGGCGT CAACCTCGCC
TTACAGCGCG TCAACGAGAT CGGCATCGAC GGCCGCGACC TCGAGATCAT CGTCGAAGAC
TCCGAGAGCG AGAACCAAGG CGGGATCGCC GCCGCCCAGA AGCTCGTCAA CCAGGACGGC
GTGCCCTTCC TCATCGGCGC AGTCGGCTCC GGCGTCTCGC TGGCAATCTA CGAGAGCGTC
GTGGAGGGGA CGGACGTCGT CCAGCTAAGC CAGAACTCCA CGGGGCTCAA CCTCACGGAT
TTCCCGGGGC TGCTCCGGAT GTCACCGTCG GGCCGCAGCC AGTCGCTCGC GCTGTCGAAC
CTCATCACTG ACGACGGCTA CGACGAGGTG GCGATCACCT ACGTCAACAA CGACTACGGC
CAGAGCCTCA CCGACGCGTT CGTCGACGCG TACGACGGCG AGGTCGTCTA CAACAGCCCG
CACGACCAGG AGCAGCAGTC CTACTCGGGA GTCATCTCCG AGATGAACAG CTCGGGCGCC
GACGCGTGGC TGTTCATCAC CTACCAGGCC GAGTTCGCGA CGATGGTCAA CGAGGTGTAC
TCGTCGGGCT ACGAGGCGCA GTTCTACGGC GCCGACTCCG TCTCCGGCGA CAACGTCCTC
GAGAACACGC CGGAGGGAAG TATCGACGGC ATGAAGATCG TGGTCCCCTC CGCGCCGATC
GAGGAGGAGA ACTACCAGTC GTTCGCGTCG GACTTCGAGG AAGAGTACGG CCGGCAGCCG
ACCTCGTGGG CCGCGTACGC GTACGACTGC GCGATCAACG CCGCGCTCGC GATCCAGGCC
GCCGACGAGT TCACCGGCGC GGCGCTTCAG GAGACCGTCC GGCGTGTCTC CGGCCCCGAA
GGGGAGGAAG CGACCTCCTT CGAGGCCGCC AGCCAGATCC TCGCAGACGG CGGCGGTCCC
GACGACGTCG ATTACCAAGG GGTCAGCGGT CCCATCGACT TCGACGAGAA CGGGGACCCG
GTCGGTTTCC TTCAGGTCTT GGAGGTCCAA GACCACGCGT ACGAAGGTAT CGACTTCATC
GAAGGCTGA
 
Protein sequence
MSPNSMDLVD RRTLLKLTGG AGVGALAGCL STTDDGEDGS DGSDGSDGSD GSDGENGTDG 
DDGSDDGGDS TDAYEIGMVD SQTGSLSAFG ERNQRGVNLA LQRVNEIGID GRDLEIIVED
SESENQGGIA AAQKLVNQDG VPFLIGAVGS GVSLAIYESV VEGTDVVQLS QNSTGLNLTD
FPGLLRMSPS GRSQSLALSN LITDDGYDEV AITYVNNDYG QSLTDAFVDA YDGEVVYNSP
HDQEQQSYSG VISEMNSSGA DAWLFITYQA EFATMVNEVY SSGYEAQFYG ADSVSGDNVL
ENTPEGSIDG MKIVVPSAPI EEENYQSFAS DFEEEYGRQP TSWAAYAYDC AINAALAIQA
ADEFTGAALQ ETVRRVSGPE GEEATSFEAA SQILADGGGP DDVDYQGVSG PIDFDENGDP
VGFLQVLEVQ DHAYEGIDFI EG