Gene Hlac_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2119 
Symbol 
ID7400639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2109336 
End bp2110418 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content55% 
IMG OID643709189 
Producthypothetical protein 
Protein accessionYP_002566766 
Protein GI222480529 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.114431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAT TATCTGCAGA AAAAAGTGAT AAAAAGGTAA CATCAATATC GCGCCGATCG 
TGGCTCAAAA CACTCGGAGT CGGCGCCTCT CTAATCAGCT TAGAGTCAGG GAGTGTTGCA
GCGACGTCAG GCGGATACGG TATCGGTGGA TATGGTGCGA GTGAATACGG TGACTCGGAT
ACTGGAGTAA CGGTCACGAC CGACGGAGCG AGCGACGTCG GCGAAACGAA TGTGACGCTC
AACGGGTCAC TGACTGATCT GGGCGGCACC TCCTTCGTGG ACGTTTACTT TGAGTATCGG
CACACCAACG TTACCACTTG GAGTGCCACC GCTACGCAAA CCGCCTCGTC AGCTGGTGGT
TTCAGTGCCG CTATTACGGG TCTCGGAGAT GGCGTTGCTT ACGAATTCAG GGCCGTTGCG
TTGACGAGCG ATGGGAATTC GGTTACCGGG TCACCGAGTA ACTTCACTAC CACCGAACAC
TCCGTGGTCG TTTCAACGGA TGGTGCGACC GCTATCGGTG AAACGACTGC GACTCTCAAC
GGCTCTGTGA CGGACCGCGG TAACGCAAAT TCAGCTGATA TCTACTTTGA GTACCGCGAA
GCCGGGAGTA GCAGTTGGAA CGCGACAAGC ACACAGACAC TTACCTCAGC GGAAAGTTTC
ACACAGAATC TGAACAACCT AAAGAGTGGT ACGGACCACG AGTTCAGAGC AGTCGCACTG
GCTAGCGACG GTGACACTGA TACTGGAGGC TCGGTCACAT TTGTGACGGT GACCGCCGAG
AGCGATCCAG CTGTCGGTAC GTTCAGTATT TCAGAGGCTG GCTCGCCGAA TCCACACGCA
GAAATCAACG TTGACTGGGC TGTTTTCGAC GTGGACGGCG ACCTCAGTCT GGTCACCGTC
TCAGTTGCTG ATTCAACTGG GGCAACTGTG AAATCCAGCA CGACGTCCGT TAGTGGTAGC
AGCGTCTCAG GATCTGATTC GCTCAAAATC AAACACGGGG GCGGCGAGGT TTACGAGGTC
ACGCTCCGTG TAGAGGACAA CGCCGGCAAC GTTGTCACTG AAACGGGGTC TGTGTCATCC
TGA
 
Protein sequence
MEKLSAEKSD KKVTSISRRS WLKTLGVGAS LISLESGSVA ATSGGYGIGG YGASEYGDSD 
TGVTVTTDGA SDVGETNVTL NGSLTDLGGT SFVDVYFEYR HTNVTTWSAT ATQTASSAGG
FSAAITGLGD GVAYEFRAVA LTSDGNSVTG SPSNFTTTEH SVVVSTDGAT AIGETTATLN
GSVTDRGNAN SADIYFEYRE AGSSSWNATS TQTLTSAESF TQNLNNLKSG TDHEFRAVAL
ASDGDTDTGG SVTFVTVTAE SDPAVGTFSI SEAGSPNPHA EINVDWAVFD VDGDLSLVTV
SVADSTGATV KSSTTSVSGS SVSGSDSLKI KHGGGEVYEV TLRVEDNAGN VVTETGSVSS