Gene SeD_A1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1004 
Symbolhcp 
ID6875070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp994374 
End bp996026 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content57% 
IMG OID642784189 
Producthydroxylamine reductase 
Protein accessionYP_002214864 
Protein GI198245246 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01703] hydroxylamine reductase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.97662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTGTG TGCAATGTGA ACAAACCATC CGTACACCAG CCGGAAACGG CTGCTCCTAC 
GCGCAGGGAA TGTGCGGTAA AACAGCTGAA ACGTCCGATC TGCAGGATCT GCTGATTGCG
GCTTTGCAAG GTTTGTCTGC CTGGGCGGTG AAGGCCCGTG AATATGGCAT CATTAATCAC
GACGTTGATA ACTTTGCGCC GCGCGCGTTT TTCTCCACGC TGACCAACGT TAACTTCGAC
TCTCCGCGTA TCGTCGGCTA CGCCCGTGAA GCGATTGCCT TGCGTGAAGC GTTGAAAGCG
CAGTGCCTGA GCGTGGATGC CAATGCGCAT TGCGACAATC CGATGGCCGA TCTGCAACTG
GTTAGCGACG ATCTGGGCGA ACTGCAACGT CAGGCGGCGG AATTTACCCC GAATAAAGAC
AAAGCCGCCA TTGGCGAGAA CATCCTCGGC CTGCGTCTGC TGTGCCTGTA CGGTCTGAAA
GGCGCGGCGG CGTATATGGA ACACGCGCAC GTTCTCGGTC AATACGACAA CGACATTTAC
GCGCAGTACC ACAAAATCAT GGCCTGGCTG GGCACCTGGC CTGCTGACAT GAACGCGCTG
CTGGAGTGCG CAATGGAAAT CGGCCAGATG AACTTCAAAG TGATGAGCAT TCTGGATGCC
GGTGAAACCA CCAAATACGG TCACCCAACG CCGACTCAGG TCAACGTCAA AGCGACTGAA
GGCAAGTGCA TTCTGATCTC CGGTCACGAT CTGAAAGATC TCTACAACCT GCTGGAGCAG
ACCGAAGGCA CCGGCGTTAA CGTCTACACT CACGGTGAAA TGCTGCCCGC ACATGGCTAT
CCGGAACTGC GTAAATTCAA GCATCTGGTC GGTAACTACG GCAGCGGCTG GCAGAATCAG
CAGGTCGAAT TCGCCCGCTT CCCTGGCCCA ATCGTGATGA CCTCTAACTG CATTATCGAC
CCGACCGTAG GCAGCTATGA CGACCGTATC TGGACCCGTA GCATCGTCGG CTGGCCGGGC
GTTAGCCATC TTGAAGGCGA TGACTTCGGG CCGGTCATTG CCCAGGCGCA GCAAATGGCG
GGCTTCCCGT ATAGCGAAAT TCCGCATCTC ATCACCGTCG GTTTTGGCCG TCAGACCCTG
CTCGGCACTG CCGATACGCT CATCGATCTG GTCAGCCGTG AAAAACTGCG CCACATCTTC
CTGGTCGGCG GCTGTGACGG CGCGCGCGGC GAGCGTAACT ACTTTACCGA TTTCGCCACC
AGCGTACCGG ATGACTGCCT GATCCTGACC CTGGCGTGCG GTAAATACCG TTTCAACAAA
CTGGAGTTCG GCGACATCGA AGGGCTGCCG CGTCTGGTCG ATGCCGGTCA GTGTAATGAT
GCTTACTCCG CGATTATCCT GGCGGTGACG CTGGCGGAAA AACTGGGCTG CGGCGTCAAC
GATCTGCCGC TATCGCTGGT GCTCTCCTGG TTTGAGCAGA AAGCGATCGT GATTCTGCTG
ACGCTGCTGT CATTAGGCGT GAAAAATATT GTCACCGGGC CGACCGCGCC GGGCTTCTTC
ACGCCGGATC TGCTGGCTAT CCTCAACGAG AAGTTCGGCC TGCGCTCCGT GACCACCGTT
GAAGAAGACA TGAAGCAATT GCTGAGCGCG TAA
 
Protein sequence
MFCVQCEQTI RTPAGNGCSY AQGMCGKTAE TSDLQDLLIA ALQGLSAWAV KAREYGIINH 
DVDNFAPRAF FSTLTNVNFD SPRIVGYARE AIALREALKA QCLSVDANAH CDNPMADLQL
VSDDLGELQR QAAEFTPNKD KAAIGENILG LRLLCLYGLK GAAAYMEHAH VLGQYDNDIY
AQYHKIMAWL GTWPADMNAL LECAMEIGQM NFKVMSILDA GETTKYGHPT PTQVNVKATE
GKCILISGHD LKDLYNLLEQ TEGTGVNVYT HGEMLPAHGY PELRKFKHLV GNYGSGWQNQ
QVEFARFPGP IVMTSNCIID PTVGSYDDRI WTRSIVGWPG VSHLEGDDFG PVIAQAQQMA
GFPYSEIPHL ITVGFGRQTL LGTADTLIDL VSREKLRHIF LVGGCDGARG ERNYFTDFAT
SVPDDCLILT LACGKYRFNK LEFGDIEGLP RLVDAGQCND AYSAIILAVT LAEKLGCGVN
DLPLSLVLSW FEQKAIVILL TLLSLGVKNI VTGPTAPGFF TPDLLAILNE KFGLRSVTTV
EEDMKQLLSA