Gene Hlac_0851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0851 
Symbol 
ID7400817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp844205 
End bp845218 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content68% 
IMG OID643707917 
Productperiplasmic solute binding protein 
Protein accessionYP_002565520 
Protein GI222479283 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCACT CACGGCGGTC GGTGCTGCGT CGCGGCGCCG GTCTCGCGGT CGCGGGAACG 
GCGGCGTCGT TAGCCGGCTG TTCCGGCACC ACAAACGGCG GATCCGGTGG GTTCGACGCC
GGCTACGCCG CCTTCTTCAC CCTCAATGAC TGGGCGAATC AGGTCGCGGG CGACCACGCG
AGCTTCGAGG ACCCGGTCGA CGTGGGGCAG CTCGGTCACG GCTGGACGCC GGACGGGAAC
CTCGCTGTAG ACGTCGCCTC CACCGACGCG TTCGTCTACC TCGACAGCTC GGAGTTCTCG
TGGGCGCAGG ATCTGGCCGC GACGCTGGAG GACGATTACG ACACGGTCGC CGTGATCGAC
GGGCTCGCCG GGCTGGAAGA GGACCTCCTT GAGTGGGACC ATAGCCACGA CGAAGAGGAG
GAAGACGCCC ACGACGACGA AGACAGCCCC GACGACGAAG ACGGCCCCGA CAGAGGGCAG
TACGACCCCC ATGTCTGGGT CGATCCGGTG CTTGCCGCCG ATGTCGTCGA CACCATCGCG
GCAGGGCTCG GCGAGGCGGA CCCGGACAAC GCCGACGACT ACGCCGACAA CGCCGCCGCC
TACGCCGAGG ATCTCGACGC GATCGACGAT GCCTTCGAGT CAATCGCCGA GAACGCCGAG
CGCGGCGTGG CGGTCATGGC GGGCCACAAC TCCTTTCAGT ACCTAGAGGC GCGCTACGGG
TTCCGGCTCC ACTCGCCGGT CGGCGTCTCG CCGCAAAACG AGCCGACGCA AAGCGAGATC
GCCGACACGA TCGAACTCGT GAACACGGAG GGGATCGACG CGGTGTTGTA CGACCGCTTC
GAGTCGCCCA GGCTCGCCGA GTCGATCGTC GAGAACAGCG ATGCCACCGA GGCGGTCCCC
GTCACGCCGG CCGGGGGGAC GACCCGTGAG TGGAACGACG CCGGGTACGG CTATCTCGAA
CAGATGACCG AGATCAACGT CCCCGCCTTC GAGCGGGCAT TCGACGCGCA GTGA
 
Protein sequence
MTHSRRSVLR RGAGLAVAGT AASLAGCSGT TNGGSGGFDA GYAAFFTLND WANQVAGDHA 
SFEDPVDVGQ LGHGWTPDGN LAVDVASTDA FVYLDSSEFS WAQDLAATLE DDYDTVAVID
GLAGLEEDLL EWDHSHDEEE EDAHDDEDSP DDEDGPDRGQ YDPHVWVDPV LAADVVDTIA
AGLGEADPDN ADDYADNAAA YAEDLDAIDD AFESIAENAE RGVAVMAGHN SFQYLEARYG
FRLHSPVGVS PQNEPTQSEI ADTIELVNTE GIDAVLYDRF ESPRLAESIV ENSDATEAVP
VTPAGGTTRE WNDAGYGYLE QMTEINVPAF ERAFDAQ