Gene Hlac_1538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1538 
Symbol 
ID7401468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1560335 
End bp1561555 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID643708604 
Productprotein of unknown function DUF405 
Protein accessionYP_002566196 
Protein GI222479959 
COG category[S] Function unknown 
COG ID[COG2311] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGCG ACGCCGGCCC GACGCCGCCC TCCGAACGCA TCGTCGCCCT CGACGCCCTC 
CGCGGGGTCG CCCTGCTCGG CATCCTCCTG ATCAACGTCT GGGCGTTCGC GATGCCCGAG
ACGACGCTGT TTAATCCGAC GGTGTACGCC GACACCACCG TTTACGGCGA CTTCACGGGG
GCGAACTACT GGGCGTGGGC GTTCAGCCAC GTGTTCGCAC AGAACAAGTT CATCACGCTG
TTCTCGGCGC TGTTCGGTGC GGGAATCCTA CTGTTCATCG AGAGCAAAGA GGAGAAGGGG
CAAGACGCGG TGCGGCTCCA CTATCGCCGG ACCGCGATTC TCATCGCGAT CGGGCTCATG
CACGCGTATC TGCTGTGGTA CGGCGACATC CTCGTCGCGT ACGGGGTGAC CGCGCTGGTC
GTCGTCGCGT TCCGGAATCT CGAAGCCCGG AAGCTCGCTG GGGTCGGCGT GGTCTTCCTG
CTGTTCCTCC CCGTGGTCGA ACTGTTCGCC GCGATCACTC TCGGCGGCGA CGCGATCGCA
TCGCAGTGGG CACCGGCGGA AGCCGCGATC GAACAGCAGG TCGCGACGTA CCGCGGCGGC
TGGCTCGAAC AGCTCGACCA CCGGGTCCCA TCCTCGTTCA GCCGACAGAC GACCGGCTAC
ATCAACGGGC CATTCTGGCA GGTCGGTGGC ACCATGCTCC TCGGGATGGC GCTGTACCGC
TGGGGCGTGT TGACCGGCGA GCGGTCGTCG GCCCTGTACC GCCGGCTCGT TGCGCTCGGT
GTTGTCGGCC TCGCGATCAC CGTCGCCGGC GTCGTCTACA TCGAGGCCAA CGACTGGAGC
GCCGGCGCCG CGCTGTACTG GCGGCAGTTC ATCTACGTCG GCAGCTTCCC CCTCGCCGGC
GGCTACCTCG GGATCGTGAT GCTGTACGCC CGCCGGCGCC CGGACGGCCC CGTGACTCGC
GGCCTCGCCG CGGTCGGGCG GACCGCGTTC ACGAACTACC TCCTGCAGAC GGTGATCGCG
ACCACCGTCT TCTACGGCCA CGGCCTCGGG CTGTTCGGCT CCGTTACCCG CGTCGAGCAG
CTCGGGTTCG TTCTCGTCGT TTGGGTGGTG CAGATCGTTT TGTCAGTCCT GTGGCTGCGA
TCTTTCCGGT TCGGTCCCGT CGAGTGGATC TGGCGGACGC TTACGTACGG GGAGCGACAG
CCGATACGAA ACCCGGAGTA G
 
Protein sequence
MTRDAGPTPP SERIVALDAL RGVALLGILL INVWAFAMPE TTLFNPTVYA DTTVYGDFTG 
ANYWAWAFSH VFAQNKFITL FSALFGAGIL LFIESKEEKG QDAVRLHYRR TAILIAIGLM
HAYLLWYGDI LVAYGVTALV VVAFRNLEAR KLAGVGVVFL LFLPVVELFA AITLGGDAIA
SQWAPAEAAI EQQVATYRGG WLEQLDHRVP SSFSRQTTGY INGPFWQVGG TMLLGMALYR
WGVLTGERSS ALYRRLVALG VVGLAITVAG VVYIEANDWS AGAALYWRQF IYVGSFPLAG
GYLGIVMLYA RRRPDGPVTR GLAAVGRTAF TNYLLQTVIA TTVFYGHGLG LFGSVTRVEQ
LGFVLVVWVV QIVLSVLWLR SFRFGPVEWI WRTLTYGERQ PIRNPE