Gene Hlac_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1417 
Symbol 
ID7400736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1429166 
End bp1430335 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content66% 
IMG OID643708478 
Productbasic membrane lipoprotein 
Protein accessionYP_002566075 
Protein GI222479838 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCCG ACTTCGATCG GCGGCGGTTC CTCAAGGCAG CAAGCGCGGC GGGCCTCGTC 
GGCCTCGCCG GCTGTAGTGG CGGACCGAGC GGCGACGGGT CCGACGGCTC CGACGGTAAC
GACAGCTCCG ACGGAGACGA CGGGTCTGAC GGCTCGGACG GGAGCGACGG ATCGGACGGC
GAGGACGGGT CCGACGGAGA CGACGTGCCG GCGACGGTCG GCATCGTCTA TTCCGACGGC
GGCCTCGGCG ACAACTCGTT CAACGACGCG GCCCAGCAGG GAATTCTCCA GGCCGAAGAG
GAGTTCGGTA TTGAGTACGA CGAGTCGGAG CCGGACGGTG CCGGCGAGTT CGGGCAGTTC
CAGCAGCTGT ACGCGAGTTC GACGGATCCA GAATACGATC TGGTCTCCTG TATCGGGTTC
AACCAGGGGG ACGCGCTCAC CGAGACTGCA CCCCAGTACC CCGATCAGGA CTTCATGATC
GTCGACACCG TGGTCGACGA GCCGAACGTC GCGAACTACT TGTTCCGTGA GCAAGAGGGC
TCGTTCCTGA TGGGCGTGCT CGCCGGACGC CTGACCGAGA CGGAGTTCTC CGCGGGCGCG
GGATCGACCG ATCCCGACTC GACGACCGTT GGGTTCGTCG GCGGCGTCGA CAGCCCGGTC
ATCCGCCGGT TCCAAGCCGG CTTCGAGGCC GGCGTCGACT ACGCGTCCGA CAATGTCGAC
GTTACCACGA GCTACGTCGG CAGCTACGCC GACCCTTCGG GCGGGCAGGA GGCCGCACTG
TCGATGTACC AAAATGGGGT GGATATCGTC TATCACGCTT CGGGCGCGAC GGGCGTCGGC
GTGTTCCAGG CCGCACAGGC GGAAGGCCGG TTCGCGTTCG GTGTCGACCA GGACCAGTCG
ATCTCCAACG AGTCGTTCGC GGACGTGATC CTGGCGTCGA TGGTGAAGCG CGTCGACACC
GCGGTGTACG AGTCGATCTC GAACATCATC GACGACAACC ATCAGGGTGG TGAGACGCTC
GCGCTCGGGC TGGAGTCGAA CGGTGTCGAG TGCGTCTACG GCGACCAGAT CGGCGGCGAA
GTGCCCGACG ATATCGTGTC CGCCGTCTCC GACGCGCGCG ACGAGATCAT CGCCGGAAAC
ATCGACGTGC CGGAGACCAC GAGCGACTAA
 
Protein sequence
MASDFDRRRF LKAASAAGLV GLAGCSGGPS GDGSDGSDGN DSSDGDDGSD GSDGSDGSDG 
EDGSDGDDVP ATVGIVYSDG GLGDNSFNDA AQQGILQAEE EFGIEYDESE PDGAGEFGQF
QQLYASSTDP EYDLVSCIGF NQGDALTETA PQYPDQDFMI VDTVVDEPNV ANYLFREQEG
SFLMGVLAGR LTETEFSAGA GSTDPDSTTV GFVGGVDSPV IRRFQAGFEA GVDYASDNVD
VTTSYVGSYA DPSGGQEAAL SMYQNGVDIV YHASGATGVG VFQAAQAEGR FAFGVDQDQS
ISNESFADVI LASMVKRVDT AVYESISNII DDNHQGGETL ALGLESNGVE CVYGDQIGGE
VPDDIVSAVS DARDEIIAGN IDVPETTSD