Gene Hlac_2697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2697 
Symbol 
ID7400904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2688799 
End bp2690094 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content69% 
IMG OID643709771 
ProductABC transporter, periplasmic binding protein, thiB subfamily 
Protein accessionYP_002567338 
Protein GI222481101 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAACG ACGACGCGAC CGAGACGGAC CGGCGTGACG CCCCCGACGA CCGGCAGGCG 
GACTCTGGCG GCGGCGCCGG CAGACGCACC GATCGACGTA CCGGGGTCCC CACCCGCCGG
CGGTTCCTCG CGCTCGGCGG CGCGGCGGGC GCCGTCGCGC TCGCGGGGTG TAGTGCCGAG
CCGACGGACG GTGAGGACGG AGACGGAGAA AACGGGACCG CCGGTGATGA CAGCGCCGAC
GGGCCGGGGT CCAGCGACGG TGACGATGGC GACGGCGACG ACGAGGAGAC CCCTACCCTG
ACGGTCGCGA CCTACAGTAG CTTCATCGAC GCGCCGTCGG TGAGTCCCGG TGAGTGGCTC
AAGGAGGCGT TCGAGTCGCG CGTCGACGCC GAACTGGAGT GGGCGACCCC GGACAACGAG
GTGAACTACT ACGTCGAGCG GGCCAGTTCG GGGGTGTCGA TCGACGCCGA CCTCTACGTC
GGGCTCACCA CCGAAGACTT GGTGCGCGTC GACGAGACGC TCGACGACGA CCTGTTCGTC
GAGCGCGGCG AGGTCGAGGG ATTCGACGAC GTGCGGGAGG GGCTGTTGTT CGACCCGTTC
GACCGCGCGG TTCCCTTCGA CACCGGCTAC GTGAGCCTCG TGTACGACGG CACCGCGATC
GAGGCGCCGG AGACGTTCGA GGGCCTGCTG GATGACGAGC ACGCAGGCGC GCTCATCGCG
CAGAATCCCG GCGCCTCGAC GACGGGGCGG TCGTTCCTGC TCCACACGGT CCACCGGTTC
GGCGACGGGC CGGACGGGTC GGTGGAGGGC GGCGACGGCG ACCCCGACTA CGACTACCTC
GACTACTGGG CGGAGCTACA GGACAACGAC GTGCGTGTGC TCGGCTCGTG GGACGACGCC
TACGCCGCCT GGAGCGGGGG GGAGGCCCCG ATGGTCGTCT CCTACTCGAC CGATCAGGTG
TTCGCGAGCA TGGAGGGGGC AGACTTGGAG AAACACCAAA TTCGGTTCCT GAACGATCAG
GCGTACGCCA ACCCGGAGGG GATGGCCGTC TTCGCCGACG CAGACGAGCC GGAGCTCGCC
CGCGAGTTCA TGTCGTTCAT GCTGGAGCCG GACGTGCAGG GGGTTATCGC CGAGCGCAAC
GTCGCGTTCC CCGCGACCGA CACGGCCGAG CTCCCCGACG ACTACGCCGA ACTGGCGCAG
GAGCCGTCCG AACCGGTGAC GTTCACGTAC GACGAGCTCC AAGGCTCGGT CAGTGAGTGG
GTCGAAGACT GGGAGCGACA GTACGCCGGG AACTGA
 
Protein sequence
MTNDDATETD RRDAPDDRQA DSGGGAGRRT DRRTGVPTRR RFLALGGAAG AVALAGCSAE 
PTDGEDGDGE NGTAGDDSAD GPGSSDGDDG DGDDEETPTL TVATYSSFID APSVSPGEWL
KEAFESRVDA ELEWATPDNE VNYYVERASS GVSIDADLYV GLTTEDLVRV DETLDDDLFV
ERGEVEGFDD VREGLLFDPF DRAVPFDTGY VSLVYDGTAI EAPETFEGLL DDEHAGALIA
QNPGASTTGR SFLLHTVHRF GDGPDGSVEG GDGDPDYDYL DYWAELQDND VRVLGSWDDA
YAAWSGGEAP MVVSYSTDQV FASMEGADLE KHQIRFLNDQ AYANPEGMAV FADADEPELA
REFMSFMLEP DVQGVIAERN VAFPATDTAE LPDDYAELAQ EPSEPVTFTY DELQGSVSEW
VEDWERQYAG N