Gene EcHS_A1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1552 
SymbolnarZ 
ID5591688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1555662 
End bp1559402 
Gene Length3741 bp 
Protein Length1246 aa 
Translation table11 
GC content55% 
IMG OID640920706 
Productnitrate reductase, alpha subunit 
Protein accessionYP_001458262 
Protein GI157160944 
COG category[C] Energy production and conversion 
COG ID[COG5013] Nitrate reductase alpha subunit 
TIGRFAM ID[TIGR01580] respiratory nitrate reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAC TTTTGGATCG CTTTCGCTAC TTCAAACAAA AGGGCGCAAC CTTTGCCGAT 
GGTCACGGAC AGGTGATGCA TAGCAACCGC GACTGGGAGG ACAGCTATCG CCAGCGTTGG
CAGTTCGACA AAATCGTGCG TTCCACCCAC GGTGTTAACT GTACAGGCTC CTGTAGCTGG
AAAATCTACG TTAAAAATGG TCTGGTGACC TGGGAAATCC AACAGACCGA CTACCCGCGC
ACTCGCCCTG ACCTGCCCAA TCATGAACCT CGCGGCTGCC CGCGTGGCGC AAGTTACTCC
TGGTATCTTT ACAGCGCTAA CCGCCTGAAA TACCCGCTCA TTCGTAAACG ACTGATTGAA
CTGTGGCGCG AAGCCCTCAA GCAACACAGC GATCCGGTAC TGGCGTGGGC ATCGATTATG
AACGATCCGC AAAAGTGCCT GAGCTACAAA CAAGTGCGTG GGTGCGGCGG GTTTATCCGC
TCCAACTGGC AGGAACTAAA CCAGCTGATT GCCGCCGCTA ACGTCTGGAC CATCAAAACC
TACGGCCCGG ATCGCGTTGC CGGTTTCTCG CCGATCCCGG CGATGTCGAT GGTTTCTTAC
GCCGCCGGAA CGCGTTATCT GTCGCTGCTT GGCGGCACCT GTTTAAGTTT CTACGACTGG
TATTGCGACC TGCCGCCCGC CTCGCCGATG ACCTGGGGCG AGCAAACCGA CGTACCGGAA
TCTGCCGACT GGTATAACTC CAGCTACATC ATCGCCTGGG GGTCTAACGT ACCGCAGACA
CGTACGCCGG ACGCCCACTT CTTTACCGAA GTACGCTACA AAGGCACTAA AACCATCGCC
ATTACCCCTG ACTACTCTGA AGTGGCCAAA TTGTGCGACC AGTGGCTGGC ACCGAAACAA
GGCACTGATA GCGCCCTGGC GATGGCAATG GGCCATGTGA TTTTAAAAGA GTTTCATCTC
GATAATCCCA GCGACTACTT TATCAACTAC TGCCGCCGCT ACAGCGACAT GCCGATGCTG
GTAATGCTGG AGCCTCGCGA CGATGGTAGC TACGTTCCCG GGCGCATGAT CCGCGCATCT
GACCTGGTGG ATGGACTGGG CGAAAGCAAC AATCCGCAGT GGAAAACCGT AGCAGTTAAT
ACCGCAGGTG AATTGGTAGT GCCGAACGGT TCGATTGGTT TCCGCTGGGG AGAAAAAGGC
AAATGGAATC TGGAATCCAT TGCCGCCGGT ACGGAAACCG AATTGTCGTT AACCCTGCTC
GGTCAACATG ACGCTGTTGC AGGCGTGGCC TTCCCCTACT TTGGCGGCAT TGAAAATCCG
CATTTTCGCA GCGTAAAACA CAATCCGGTG CTGGTGCGCC AATTGCCCGT TAAAAACCTG
ACGTTAGTCG ATGGCAACAC CTGTCCGGTG GTCAGCGTTT ATGATTTGGT ACTGGCGAAT
TACGGCCTCG ATCGCGGGCT GGAAGATGAA AACAGTGCGA AAGATTACGC TGAAATCAAA
CCGTACACCC CAGCCTGGGG TGAGCAAATT ACCGGCGTGC CGCGCCAGTA TATTGAAACC
ATCGCTCGTG AATTTGCCGA TACTGCCCAT AAAACGCATG GGCGCTCAAT GATTATCCTC
GGTGCTGGCG TCAACCACTG GTATCACATG GATATGAACT ACCGTGGGAT GATCAATATG
CTGATCTTCT GCGGTTGTGT CGGGCAAAGC GGCGGCGGCT GGGCGCATTA TGTCGGTCAG
GAAAAACTGC GCCCACAAAC CGGCTGGTTG CCGCTGGCCT TTGCGCTCGA CTGGAACCGC
CCACCGCGCC AGATGAACAG TACCTCGTTT TTCTACAATC ATTCCAGCCA ATGGCGCTAT
GAAAAAGTCT CTGCGCAGGA GTTACTTTCA CCGCTCGCCG ATGCCAGTAA GTACAGCGGT
CATCTGATTG ATTTCAACGT TCGCGCCGAA CGTATGGGCT GGCTGCCCTC TGCGCCGCAG
CTGGGGCGTA ACCCGCTCGG GATTAAAGCT GAAGCCGACA AAGCAGGATT ATCCCCCACA
GAATTTACCG CCCAGGCGCT GAAATCGGGC GATTTACGTA TGGCCTGCGA ACAACCAGAT
AGCGGCAGCA ATCATCCGCG TAATTTGTTT GTCTGGCGTT CTAACCTGCT TGGCTCCTCC
GGCAAAGGCC ACGAGTATAT GCAGAAGTAT CTGCTGGGGA CCGAAAGCGG GATTCAGGGC
GAGGAACTCG GTGCCAGCGA CGGGATCAAA CCGGAAGAAG TCGAGTGGCA AACTGCGGCG
ATTGAAGGCA AGCTCGACCT GCTGGTGACG CTCGACTTCC GCATGTCCAG CACCTGCCTG
TTCTCCGATA TTGTTCTGCC TACCGCCACC TGGTACGAAA AAGACGATAT GAATACCTCG
GATATGCATC CGTTTATTCA TCCACTTTCT GCGGCAGTCG ATCCGGCCTG GGAGTCACGC
AGCGACTGGG AAATCTACAA AGGTATTGCC AAAGCATTTT CGCAAGTGTG CGTGGGTCAT
CTTGGCAAAG AAACCGACGT GGTATTACAA CCACTGCTGC ACGACTCTCC GGCAGAGCTC
TCACAGCCGT GTGAAGTCCT CGACTGGCGC AAAGGCGAAT GCGATCTGAT TCCAGGTAAA
ACCGCACCCA ATATTGTGGC GGTGGAGCGC GACTACCCTG CTACCTATGA ACGCTTTACC
TCGCTCGGGC CATTGATGGA CAAACTTGGC AACGGCGGTA AAGGGATTTC GTGGAATACG
CAGGATGAAA TCGATTTCCT CGGTAAACTC AATTACACCA AGCGTGATGG CCCAGCGCAG
GGGCGTCCGC TGATTGACAC CGCCATTGAC GCTTCAGAAG TGATTCTGGC ACTGGCACCA
GAAACAAACG GTCATGTTGC AGTTAAAGCG TGGCAGGCGC TGGGCGAGAT CACCGGACGC
GAACATACCC ATCTGGCGCT GCACAAAGAG GACGAGAAGA TTCGCTTTCG CGATATTCAG
GCGCAGCCAC GTAAAATTAT CTCCAGCCCC ACATGGTCTG GTCTGGAAAG CGATCACGTC
TCCTATAACG CGGGATACAC CAACGTTCAT GAGTTAATTC CGTGGCGCAC GCTGTCGGGA
CGCCAGCAGC TCTATCAGGA TCATCCGTGG ATGCGTGCTT TTGGTGAAAG CCTGGTGGCT
TATCGCCCGC CTATCGACAC CCGTAGCGTC AGTGAGATGC GCCAGATACC GCCAAACGAC
TTCCCGGAAA AAGCACTTAA CTTCCTGACG CCGCACCAGA AATGGGGCAT TCACTCAACC
TACAGTGAAA ACCTGCTAAT GCTGACGCTC TCTCGCGGTG GACCGATTGT CTGGATCAGC
GAAACCGATG CCCGAGAACT GACCATTGTC GATAACGACT GGGTGGAAGT GTTTAACGCC
AATGGCGCGC TGACGGCCCG CGCGGTGGTC AGCCAACGTG TACCGCCGGG TATGACCATG
ATGTATCACG CTCAGGAACG CATTATGAAT ATTCCTGGTT CGGAAGTAAC TGGCATGCGC
GGCGGTATTC ATAACTCGGT TACCCGCGTT TGCCCGAAAC CAACGCATAT GATTGGCGGT
TACGCGCAGC TGGCCTGGGG CTTTAACTAC TACGGCACCG TCGGATCGAA CCGCGATGAG
TTCATCATGA TCCGCAAGAT GAAGAACGTT AACTGGCTGG ATGATGAAGA TCGCGATCAG
GTACAGGAGG CGAAAAAATG A
 
Protein sequence
MSKLLDRFRY FKQKGATFAD GHGQVMHSNR DWEDSYRQRW QFDKIVRSTH GVNCTGSCSW 
KIYVKNGLVT WEIQQTDYPR TRPDLPNHEP RGCPRGASYS WYLYSANRLK YPLIRKRLIE
LWREALKQHS DPVLAWASIM NDPQKCLSYK QVRGCGGFIR SNWQELNQLI AAANVWTIKT
YGPDRVAGFS PIPAMSMVSY AAGTRYLSLL GGTCLSFYDW YCDLPPASPM TWGEQTDVPE
SADWYNSSYI IAWGSNVPQT RTPDAHFFTE VRYKGTKTIA ITPDYSEVAK LCDQWLAPKQ
GTDSALAMAM GHVILKEFHL DNPSDYFINY CRRYSDMPML VMLEPRDDGS YVPGRMIRAS
DLVDGLGESN NPQWKTVAVN TAGELVVPNG SIGFRWGEKG KWNLESIAAG TETELSLTLL
GQHDAVAGVA FPYFGGIENP HFRSVKHNPV LVRQLPVKNL TLVDGNTCPV VSVYDLVLAN
YGLDRGLEDE NSAKDYAEIK PYTPAWGEQI TGVPRQYIET IAREFADTAH KTHGRSMIIL
GAGVNHWYHM DMNYRGMINM LIFCGCVGQS GGGWAHYVGQ EKLRPQTGWL PLAFALDWNR
PPRQMNSTSF FYNHSSQWRY EKVSAQELLS PLADASKYSG HLIDFNVRAE RMGWLPSAPQ
LGRNPLGIKA EADKAGLSPT EFTAQALKSG DLRMACEQPD SGSNHPRNLF VWRSNLLGSS
GKGHEYMQKY LLGTESGIQG EELGASDGIK PEEVEWQTAA IEGKLDLLVT LDFRMSSTCL
FSDIVLPTAT WYEKDDMNTS DMHPFIHPLS AAVDPAWESR SDWEIYKGIA KAFSQVCVGH
LGKETDVVLQ PLLHDSPAEL SQPCEVLDWR KGECDLIPGK TAPNIVAVER DYPATYERFT
SLGPLMDKLG NGGKGISWNT QDEIDFLGKL NYTKRDGPAQ GRPLIDTAID ASEVILALAP
ETNGHVAVKA WQALGEITGR EHTHLALHKE DEKIRFRDIQ AQPRKIISSP TWSGLESDHV
SYNAGYTNVH ELIPWRTLSG RQQLYQDHPW MRAFGESLVA YRPPIDTRSV SEMRQIPPND
FPEKALNFLT PHQKWGIHST YSENLLMLTL SRGGPIVWIS ETDARELTIV DNDWVEVFNA
NGALTARAVV SQRVPPGMTM MYHAQERIMN IPGSEVTGMR GGIHNSVTRV CPKPTHMIGG
YAQLAWGFNY YGTVGSNRDE FIMIRKMKNV NWLDDEDRDQ VQEAKK