Gene Hlac_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3454 
Symbol 
ID7402300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp202124 
End bp203047 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content67% 
IMG OID643709995 
Productmolybdate transport protein 
Protein accessionYP_002567561 
Protein GI222481325 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2998] ABC-type tungstate transport system, permease component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATAC AACGCCGCCA GTTCGTCGTG GCTATCGGCG CCGGGGGAGT CGCGACGGGC 
CTCGCCGGGT GTTCGCAGGT GGTCGGGAGC GACGGCCAGC CGGCGGTGGC GGGTGAAACC
CTGACGCTCA CGACGACGAC GAGCACCTAC GACACCGGAC TGCTCGACGA CATCCACCCC
GACTTCGAGG ACATGTACGG GGTGTCCGTC GACGCGGTCG CACAAGGAAC CGGTGCCGCC
CTCCAGTCCG CGCGCAACGG CGACGCCGAC GTCGTGATGG TCCACGCCCG CGGCCTCGAG
GACGGGTTCA TGCGCAACGG GCACGGCATC AACCGCCGCG ACCTCATGTT CAACGACTTC
GTCGTCGTCG GCCCGGAGAG CGACCCGGCA GGCATCGAGG GTTCGAGTTC GGCGACCGAG
GTGCTCGATG CCATCGCCGA CGCCGAAGCG ACGTTCGTCT CCCGGGGCGA CAACTCGGGA
ACCCACACGA AAGAGCTCGA CCTCTGGGAC GCCACGGACG CCGAACCGGG CGGGGACTGG
TACCAGGAGA CCGGGACCGG GATGGGCCAG GCGCTGAACG TCGCGGCCCA GCAGGGCGCG
TACACGCTCT CCGATCGCGG GACGTTCATC TCCCAGCGCG GCCAGATCGA CCTCGCGATC
CTGGTACAGG GCCCAATCGA GGGTGGTCCC GAGATCCTCG CGAACCCCTA CGGCATCATG
GCAGTCAATC CAGGGAAACA CGAGAACGCC AACTACGACC TCGCGATGGC GTACATCGGC
TGGATAACCA GCCCCGGTGC CCAGGACGCC ATCTCGGGGT ACCAGGTGAA CGGCGAACAG
TTGTTCTTCC CCGAGGCCGT TTCCGAAGAC CCCAACTTCC AGCAGTACGT CCCCGACGGG
TGGAGCGACG ACTCCAACGA CTGA
 
Protein sequence
MPIQRRQFVV AIGAGGVATG LAGCSQVVGS DGQPAVAGET LTLTTTTSTY DTGLLDDIHP 
DFEDMYGVSV DAVAQGTGAA LQSARNGDAD VVMVHARGLE DGFMRNGHGI NRRDLMFNDF
VVVGPESDPA GIEGSSSATE VLDAIADAEA TFVSRGDNSG THTKELDLWD ATDAEPGGDW
YQETGTGMGQ ALNVAAQQGA YTLSDRGTFI SQRGQIDLAI LVQGPIEGGP EILANPYGIM
AVNPGKHENA NYDLAMAYIG WITSPGAQDA ISGYQVNGEQ LFFPEAVSED PNFQQYVPDG
WSDDSND