Gene Hlac_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1198 
Symbol 
ID7399465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1205156 
End bp1206604 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content65% 
IMG OID643708263 
Producthypothetical protein 
Protein accessionYP_002565862 
Protein GI222479625 
COG category 
COG ID 
TIGRFAM ID[TIGR02537] archaeal flagellin N-terminal-like domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000167019 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000759697 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGCA GCGAGCGCGG CCAAAGCGAG GTGATCGGGG TCGTGCTCCT GTTGGGGATA 
ACGATCGCCG CCGTGACGGT GACGGTGGCG ACCGGAAGCG CGGCGCTCGG GTTGGTCACC
GACGAGGCCC GCTCGGCGAG CGTGGAAAAT GGGATGTCAC AGCTCAGCTC TCAGTCGAGT
CTCGTCGCGC TCGGCGGGAC GGACGCGCGC CGGTTCGACC TCGGGTCCGT CGACGGCGGA
CAGCTCCGCC TCGACGAGGA GGCGGGGCGC GTCGAGGTGC GCATCGAGAA CGGGACCGAC
ACCATCACCA CGTACAACGG CTCGATCGGA ACGCTGGAGT ACGTCGGCGA CCGGCGCAAC
GTCGCGATGC AGGGTGGCGG CGTCTGGGCG ATGGAAGGCG GCCGCGGGCG GATGATCTCA
CCGCCGGAGT ACCACTACCG CGGTGAGACA CTCACCTTCC CGATCGTTCG GCTGATCGGG
GACGAGTCGT CGCCCACGAG CGGAACCGGC ATCGTCCGCC GGACTGCAAA CGATCCCGGT
GCAGTCACCG AAACCGCGAA CCCGCTCCGG AACGGGACCG TCGTCGTCGA GGTCGAAAGC
GAGTACTACG AGGGGTGGTA CGACTTTTTC ACCCGGCGCG CCGACGGCAC CGTAACGAAG
GACGACGCGA ACCAGACGAC GACCGCCCGA CTGGTGGTCC CGGAGGAGGT GAGCTTCGAC
AGGACGCTGG CCGTCAGCGA GGCTGACGGC TACTCCCACT CCGGGAACAA AAATAACGAA
CTGAGTGAGG GCGACTACGT CGAGGGAGAG AGCTTCCCGT CCCCGGGATC GCTGATCGCC
GATCAGATCG CCGCAGCCGC CGACGACAAC GACAACGGCA CGGAGACCTG TGTAACCGCG
AGCGGATTCA ACGGGTGTGG GACGGTCGGA TCCGGGACGG TCGGGTCCGG GGTGTATTAC
TTCGGCGGCG ACGCCGAGGT CATCGGCGAC CTCACGTTCA ACACTACCGA TGGCGACATC
GTCGTCGCTG TCGACGGCGA TTTCGATATC GGAGACAACG ACATCACCGT TGAAGACGGA
CGCAACAACG TCACCTACTA CATCAACGGC TCGCTCGATC TGCAGGGGAG CCCGACCGTC
AGTGTCGACT CCGCGAGCCG GAACGTGTTC TACGTCAACG GGGGGTTCCT CGACGGAAGC
GCGGGGGACG GAAACCCGAC CATCGAGGCG ATCGTCTACG CCCCGAACGC GAACGTCGTG
ACCAACGGAA ACCCGACGCT CAGAGGCGCA TTCGTCACGA AGTCGCTCTC GACCGGCGGG
AACGCGAAGG TCCAGTACGA CGAGAGCCTC AGGGGTCTGG AGATCCGGAT CACCGGCGGA
TCGGGGCAAA ATCCCATCAC CTACCTCCAC GTGAGCGAGA ACGTGGTCGA GGTTGATTTC
GATCGGTGA
 
Protein sequence
MKRSERGQSE VIGVVLLLGI TIAAVTVTVA TGSAALGLVT DEARSASVEN GMSQLSSQSS 
LVALGGTDAR RFDLGSVDGG QLRLDEEAGR VEVRIENGTD TITTYNGSIG TLEYVGDRRN
VAMQGGGVWA MEGGRGRMIS PPEYHYRGET LTFPIVRLIG DESSPTSGTG IVRRTANDPG
AVTETANPLR NGTVVVEVES EYYEGWYDFF TRRADGTVTK DDANQTTTAR LVVPEEVSFD
RTLAVSEADG YSHSGNKNNE LSEGDYVEGE SFPSPGSLIA DQIAAAADDN DNGTETCVTA
SGFNGCGTVG SGTVGSGVYY FGGDAEVIGD LTFNTTDGDI VVAVDGDFDI GDNDITVEDG
RNNVTYYING SLDLQGSPTV SVDSASRNVF YVNGGFLDGS AGDGNPTIEA IVYAPNANVV
TNGNPTLRGA FVTKSLSTGG NAKVQYDESL RGLEIRITGG SGQNPITYLH VSENVVEVDF
DR