Gene Hlac_1407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1407 
Symbol 
ID7400726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1418679 
End bp1420439 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content68% 
IMG OID643708468 
Productnitrite/sulfite reductase hemoprotein beta-component ferrodoxin domain protein 
Protein accessionYP_002566065 
Protein GI222479828 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAACA AAAAGGAGGA CTGGAAATCG GAGATGTACG GCGACGAGGT TCGCGAGAAG 
CTACTGGAGT TCGCGGAGAC GGGGTTCGAG TCGATCCCGG AAGACGAGCG GGACGCGTGG
TTCACCCGGT TCAAGTTCTG GGGAGTGTTC CACCAGCGCT CGGGCCAAGA GGGCTACTTC
ATGATGCGGC TGACGAACGC CAACGGGATC TTGGAGCCCG GTCAGCTCCG CGCGATCGCC
GAGGTCGCCC GCGACTACGC GTCCGGCCCG GTGTCGAACC CCGAGTTCGG CGACAGCTGG
ATCGATCTGA CGACCCGCCA GTCCGTCCAG CTCCACTGGA TCAAACTCGA AGACGTCCCG
GAGATCTGGG AGAAGCTCGA ATCGGTCGGC GTGACCACCC GGTCGTCGGG CGGCGACACG
ATGCGGAACA TCTCGGGCAG TCCGGTCGCC GGCAAGGACG CCGACGAGGT GATCGACACG
CTCCCGCTGC TCGAACGGTT CCAAGAGGAG ATCCGCGGCG ACGACGACCT CTGTAACATG
CCCCGGAAGT TCAACATCAG CATCTCCGGG ACGCCCGAGG GCGGCGCGCA GGACGCGATC
AACGACATCG GACTCGAACC CGCGAAAAAA GAGATCGACA GCGGGGAGAC GCTCGGCTTC
AACGTGCGCG TCGGCGGCGG GCTCGGCGGG CGAGAGCCCC GCGTCGCCCG CCCGCTGGAC
GTGTTCGTGA CCCCGGACGA GGCGTACGAC GTCGTCCGCG GGTTCGTCGA ATTGTACCAC
GACCACGGCG ACCGGCAGGT CCGCGCGAAG AACCGCTCGC GCTTCTTCGT CGACGATCAC
GGCACCGAGT GGATCCGCGA CCTGCTCGAC GAGGAGTACG TCGACGCCGA TCTGCGGACC
GCCGGCGAGG ACATCCGCGA CGAGTACACG TACAACGCCG GTCGGACCAC CGAGGACGGG
AAGCAGGACT ACACCGGCGT CCACGAGCAG AGCGACGGAA GGCGGTACGT GGGCCTCAGC
GTCCCAGTCG GTCGGCTGCC GGCCGAGGAA GCGATCGAGC TCGCCGACCT CGCCGACGAG
TACGGCTCCG GCGAGGTCCG GCTCACGCGC CGGCAGAACC CGATCATCGT CGACGTGAAC
TCCGACCGCG TGGACGAGCT GCTCGCGGAG CCGCTGCTCG CGAAACACCG GCCCGAACCG
AACCGGTTCG TCCGCGGGGC GATGGCCTGT ACCGGCACCG AGTTCTGCTC GATAGCGCTC
GTCGAGACCA AGACCCGAAT GGCGACGATG CTCCGCTGGC TCCGCGAGAA CGTCGACCTG
CCGGACGACG TCGACCAGCT CAAGATCCAC TTCTCCGGCT GCACCGCCGA CTGCGGACAG
GCCAACACGG CGGACATCGG CCTGCTCGGC ATGCGAGCGC GGAAGGACGG CGAGATGGTC
GAGGCGGTCG ACATCGGCGT CGGCGGCGGG ATGGGCGAGG AGCCGTCCTT CGTCGACTAC
GTCCAACAGC GGGTGCCGGC CGACGAGGCC CCCGGAGCGA TCCGGAACCT GATCGAGGCG
TTCGCGGAGC GCCGGGCGCC GGGCCAGACG TTCCGCGAGT GGGCGGACGC GACCGACACC
GAGACGCTCG CGGCGCTGTG CGAGCCCGAG GAGACGGACT ACGTCGACCC GTCGCTCACC
GACGCGAAGC AGTCGTGGTA CCCGTTCGCC GAGGAGGGGA CGACGGCGGC GAATCCGGAG
GTGGCCTCCT CGGATGACTG A
 
Protein sequence
MANKKEDWKS EMYGDEVREK LLEFAETGFE SIPEDERDAW FTRFKFWGVF HQRSGQEGYF 
MMRLTNANGI LEPGQLRAIA EVARDYASGP VSNPEFGDSW IDLTTRQSVQ LHWIKLEDVP
EIWEKLESVG VTTRSSGGDT MRNISGSPVA GKDADEVIDT LPLLERFQEE IRGDDDLCNM
PRKFNISISG TPEGGAQDAI NDIGLEPAKK EIDSGETLGF NVRVGGGLGG REPRVARPLD
VFVTPDEAYD VVRGFVELYH DHGDRQVRAK NRSRFFVDDH GTEWIRDLLD EEYVDADLRT
AGEDIRDEYT YNAGRTTEDG KQDYTGVHEQ SDGRRYVGLS VPVGRLPAEE AIELADLADE
YGSGEVRLTR RQNPIIVDVN SDRVDELLAE PLLAKHRPEP NRFVRGAMAC TGTEFCSIAL
VETKTRMATM LRWLRENVDL PDDVDQLKIH FSGCTADCGQ ANTADIGLLG MRARKDGEMV
EAVDIGVGGG MGEEPSFVDY VQQRVPADEA PGAIRNLIEA FAERRAPGQT FREWADATDT
ETLAALCEPE ETDYVDPSLT DAKQSWYPFA EEGTTAANPE VASSDD