Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1335 |
Symbol | narH |
ID | 5593515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1332043 |
End bp | 1333581 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640920492 |
Product | nitrate reductase, beta subunit |
Protein accession | YP_001458053 |
Protein GI | 157160735 |
COG category | [C] Energy production and conversion |
COG ID | [COG1140] Nitrate reductase beta subunit |
TIGRFAM ID | [TIGR01660] nitrate reductase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.00571547 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTC GTTCACAAGT CGGCATGGTG CTGAATCTCG ATAAGTGCAT CGGCTGCCAC ACCTGTTCAG TTACCTGTAA AAACGTCTGG ACCAGCCGTG AAGGCGTGGA ATACGCGTGG TTCAACAACG TGGAAACCAA GCCGGGTCAG GGCTTCCCGA CTGACTGGGA AAACCAGGAA AAATACAAAG GCGGCTGGAT CCGTAAAATC AATGGCAAAC TGCAGCCGCG CATGGGTAAC CGCGCCATGC TGCTGGGTAA AATCTTCGCT AACCCGCATC TGCCGGGGAT CGACGATTAT TACGAGCCGT TCGATTTTGA TTATCAGAAC CTGCATACCG CGCCGGAAGG CAGCAAATCG CAGCCGATTG CCCGTCCGCG TTCGCTGATT ACCGGGGAAC GGATGGCGAA AATCGAAAAA GGGCCGAACT GGGAAGATGA CCTGGGTGGT GAGTTTGACA AACTGGCGAA AGACAAGAAC TTCGACAACA TCCAGAAGGC GATGTATAGC CAGTTCGAAA ACACCTTTAT GATGTATTTG CCGCGCCTGT GCGAACACTG CCTGAACCCG GCATGTGTGG CGACCTGCCC GAGCGGTGCG ATTTACAAGC GTGAAGAAGA TGGCATCGTC CTGATCGACC AGGATAAATG CCGTGGCTGG CGTATGTGCA TCACCGGATG CCCGTACAAA AAAATCTACT TCAACTGGAA GAGCGGTAAG TCTGAGAAGT GCATCTTCTG CTATCCGCGT ATTGAAGCGG GTCAGCCGAC CGTGTGCTCA GAAACCTGTG TCGGTCGTAT CCGTTATCTT GGCGTGCTGC TGTACGATGC CGACGCGATT GAACGTGCAG CCAGCACCGA GAACGAGAAA GATCTTTACC AGCGTCAGCT GGAGGTGTTC CTCGATCCGA ACGATCCGAA AGTCATCGAG CAGGCGATTA AAGACGGTAT TCCGCTGAGC GTTATTGAAG CCGCACAGCA GTCGCCGGTT TATAAAATGG CAATGGAATG GAAACTGGCG CTGCCGCTGC ATCCAGAATA TCGCACACTG CCGATGGTCT GGTACGTGCC GCCTCTGTCT CCGATTCAGT CTGCAGCAGA CGCGGGTGAG CTGGGTAGCA ACGGCATTCT GCCAGACGTC GAAAGCTTGC GTATTCCGGT ACAGTATCTG GCGAATCTGC TGACCGCCGG TGATACCAAA CCGGTACTGC GCGCACTGAA ACGTATGCTG GCGATGCGTC ATTACAAACG TGTTGAAACC GTTGACGGTA AAGTTGATAC CCGTGCGCTG GAAGAGGTCG GTCTGACCGA AGCCCAGGCA CAGGAGATGT ACCGTTATCT GGCGATTGCT AACTACGAAG ATCGCTTTGT GGTGCCGAGT AGTCATCGTG AACTGGCACG GGAAGCCTTC CCGGAGAAAA ATGGCTGCGG CTTTACCTTT GGTGATGGCT GCCACGGTTC AGATACCAAA TTCAATCTGT TCAACAGCCG TCGTATCGAT GCCATCGATG TGACCAGCAA AACGGAGCCG CATCCATGA
|
Protein sequence | MKIRSQVGMV LNLDKCIGCH TCSVTCKNVW TSREGVEYAW FNNVETKPGQ GFPTDWENQE KYKGGWIRKI NGKLQPRMGN RAMLLGKIFA NPHLPGIDDY YEPFDFDYQN LHTAPEGSKS QPIARPRSLI TGERMAKIEK GPNWEDDLGG EFDKLAKDKN FDNIQKAMYS QFENTFMMYL PRLCEHCLNP ACVATCPSGA IYKREEDGIV LIDQDKCRGW RMCITGCPYK KIYFNWKSGK SEKCIFCYPR IEAGQPTVCS ETCVGRIRYL GVLLYDADAI ERAASTENEK DLYQRQLEVF LDPNDPKVIE QAIKDGIPLS VIEAAQQSPV YKMAMEWKLA LPLHPEYRTL PMVWYVPPLS PIQSAADAGE LGSNGILPDV ESLRIPVQYL ANLLTAGDTK PVLRALKRML AMRHYKRVET VDGKVDTRAL EEVGLTEAQA QEMYRYLAIA NYEDRFVVPS SHRELAREAF PEKNGCGFTF GDGCHGSDTK FNLFNSRRID AIDVTSKTEP HP
|
| |