Gene Hlac_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1021 
Symbol 
ID7401916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1013044 
End bp1014120 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content70% 
IMG OID643708087 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_002565688 
Protein GI222479451 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.664925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCGT CCGACGACGA GTACGATATC GCGGTGGTCG GCGGCGGGCC GGCCGGCCTG 
ACGACTGCCC TATACGGGGC GCGACTGGGC CACGAAACGG TGCTGATCGA CCGCGGCGGC
GGCCGCGCGG CGATGATGGC CGACACGCAC AACGTGATCG GCGTCACCGA GGAGACCTCC
GGCAACGAGT TCCTCGCGAC CGGCCGCGAG CAGGTGCAGT CGTACGGCGG CACGTTCGAG
CGCGGCTTCG TCACCGACGT CGACCGCACC GACGACGACC GATTCCGGCT CTCGACGACC
GGCGCCGAGA TTCTCTCCGA TCGCGTCGTG CTCGCCACCG GCTTCTCCGA CAAGCGGCCG
GATCCGCCGC TCCCGCGGAC GGGCAAGGGG CTCCACTACT GTCTCCACTG TGATGCGTAC
ATGTTCGTCG ACGAGCCGGT GTACGTGATG GGCCACGGCG AGGCGGCCGC CCACGTCGCG
ATGATCATGC TGAACGTGAC CGACGACGTG GATATCCTGA CCCGGGGCGC GGAGCCGACG
TGGAGCGACG AGACCGCCGC ACAGCTCGAC GCACACCCGG TCGAGGTCGT CAGCGAGGAC
GTGACGGGCG TGGAGAACGA CCCCGACTCC GGCTGGCTGG AGGCGCTGGA GTTCGAAGAC
GGCACCCGCC GCGAGTACCG CGGCGGCTTC GCGATGTACG GCTCCGACTA CAACACCGCG
CTCGCCGAGG GGCTCGGCTG CGATCTGACC GAGGGCGGCG AGATCGACGT CGACGACCAC
GGCCGTACCA GCGAGAACGG CGTGTTCGCG GTCGGCGACA TCACCCCCGG CCACAACCAG
GTACCCGTCG CCATGGGGCA GGGCGCGAAA GCCGGCCTCG CGATCCACAA GGATATCCGC
GAGTTCCCGC GCTCGCAGGA GACGATCGAG GCGGACGGCC CCGTCGACGC CGACGAGGTG
CCCGCCATCT CGCCAGCGCT CATGGCGACC GCGGTCGCCC ACGAGGGCCA CGCGGGTGGA
GCGCGGGTGA AAGGCGTCGA GGCCAAAGAG GAGACGCCCG CGGCTGACGA CGACTGA
 
Protein sequence
MSSSDDEYDI AVVGGGPAGL TTALYGARLG HETVLIDRGG GRAAMMADTH NVIGVTEETS 
GNEFLATGRE QVQSYGGTFE RGFVTDVDRT DDDRFRLSTT GAEILSDRVV LATGFSDKRP
DPPLPRTGKG LHYCLHCDAY MFVDEPVYVM GHGEAAAHVA MIMLNVTDDV DILTRGAEPT
WSDETAAQLD AHPVEVVSED VTGVENDPDS GWLEALEFED GTRREYRGGF AMYGSDYNTA
LAEGLGCDLT EGGEIDVDDH GRTSENGVFA VGDITPGHNQ VPVAMGQGAK AGLAIHKDIR
EFPRSQETIE ADGPVDADEV PAISPALMAT AVAHEGHAGG ARVKGVEAKE ETPAADDD