Gene Hlac_2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2193 
Symbol 
ID7401126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2176186 
End bp2177748 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content67% 
IMG OID643709263 
ProductChlorite dismutase 
Protein accessionYP_002566840 
Protein GI222480603 
COG category[S] Function unknown 
COG ID[COG3253] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.242985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGAGG CACCACAGAC CGAGGAGGGG TGGTTCGCGC TGCACGACTT CCGCTCTATC 
GACTGGGACG CGTGGCGAGA CGCCCCCGAG CGAGAGCGGA AACGGGCGAT CGAGGAGGGC
AAGGCCTTCC TGAAACACCG GGAGCTGGTC GCCGACGCCG ACGAGGGCGA CTCCGGGCTG
TTCTCCGTTC TCGGGCACAA GGCCGACCTG CTGTTCGTCC ACTTCCGCCC CACGCTCGAC
GACCTCTCGT CGATCGAGCG CCGGTTCGAG GACACTGCGC TCGCGAACTT CACCGAGCGG
ACGACCTCCT ACGTCTCCGT CACCGAGGTC TCCGGCTACG TCTCCGACGA GTTCTTCGAG
GACCCCGAAT CGGTCGATAC GGGGCTCAAA CGCTACATCG AGGGGAAAAT GACGCCGGAG
ATCCCCGACG ACGAGTACGT CTGCTTCTAC CCGATGAGCA AGCGCCGCGG CGAGGAGTAC
AACTGGTACG ACCTCTCCTT TGAGGACCGC GCCGACCTCA TGGCCGACCA CGGCGAGGTC
GGCAAGGAGT ACGCCGGCAA GATCAAGCAG GTGATCGCCT CCTCCGTCGG CTTCGACAGC
CACGAGTGGG GCGTTACCCT GTTCGGCTCG GACCCGACCG ACATCAAGGA CATCGTCTAC
GAGATGCGCT TCGACCCCGC CTCCTCGCGA TACGGCGAGT TCGGCGAGTT CTACATCGGT
CGCCGGTTCC CGCCCGAGGA TCTGGGCGCG TACTTCGCGG GCGAGACGGT GCCGACGCCA
GCGGGCGACG GCGATACCGG AGATACCGAG GACGGGCACG GACACGCTCA CGGCGAGGGC
CACGACCATG CCGGTTCCGG CGGCGGATCC GCACACGGCG ACCACCCCCA CGGCGAGGAG
GAGACGAGCG GCGAGGGCGA CCACCCGCAT TCGGGTGAGG AGGGCGGACA CGGCGGCGAG
GATGGTGACG ACCCCTCCGA CGCCGATATC CGCGGCGAGC TCGCGGATCT CAACATCTAC
GCCGGTAAAC CCAGCGGCGA GGACGTGTAC GCGACTGTGC TCTACAGCGA GGCCGACGTC
GACGAGTTGT TCGACGAGGT CGAGGGGCTC CGCGGCAACT TCGACCACTA CGGCACCCAC
GTGAAGACGG CGGTGTACGA GGGTCGCGTG ACCGATCGCG CCGCGGTCGT CTCCATCTGG
GACACGGCCT CCGCCGCCGA GACCGCGGCC GGCTTCCTCT CGGAGCTACC GGAGGTCGTC
GCCCGCGCCG GCGAGGAGTC CGGATTCGGC ACGATGGGGA TGTTCTACAC CGTCAAGCCG
GAACATCAGG AGGACTTCAC CGACACCTTC GACGACGTCG GGGAGATACT CGCGGAGATG
GACGGCCACG TCGAGACCGA CCTGATGATG AACGTCGAAG ACGAGAACGA CATGTTCATC
GCGAGCCAGT GGCACGCCAA GGAGGACGCG ATGGCGTTCT TCGGCTCCGA CGAGTTCCGC
GAGACGGTCC AGTGGGGCCG AGAGGTGCTG GCCGACCGGC CGCGTCACGT CTTCCTGGCG
TAA
 
Protein sequence
MVEAPQTEEG WFALHDFRSI DWDAWRDAPE RERKRAIEEG KAFLKHRELV ADADEGDSGL 
FSVLGHKADL LFVHFRPTLD DLSSIERRFE DTALANFTER TTSYVSVTEV SGYVSDEFFE
DPESVDTGLK RYIEGKMTPE IPDDEYVCFY PMSKRRGEEY NWYDLSFEDR ADLMADHGEV
GKEYAGKIKQ VIASSVGFDS HEWGVTLFGS DPTDIKDIVY EMRFDPASSR YGEFGEFYIG
RRFPPEDLGA YFAGETVPTP AGDGDTGDTE DGHGHAHGEG HDHAGSGGGS AHGDHPHGEE
ETSGEGDHPH SGEEGGHGGE DGDDPSDADI RGELADLNIY AGKPSGEDVY ATVLYSEADV
DELFDEVEGL RGNFDHYGTH VKTAVYEGRV TDRAAVVSIW DTASAAETAA GFLSELPEVV
ARAGEESGFG TMGMFYTVKP EHQEDFTDTF DDVGEILAEM DGHVETDLMM NVEDENDMFI
ASQWHAKEDA MAFFGSDEFR ETVQWGREVL ADRPRHVFLA