Gene EcolC_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1444 
Symbol 
ID6067475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1593796 
End bp1596282 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content56% 
IMG OID641600863 
Productnitrate reductase catalytic subunit 
Protein accessionYP_001724434 
Protein GI170019480 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01706] periplasmic nitrate reductase, large subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCA GTCGTCGTAG CTTTATGAAA GCTAACGCCG TTGCGGCCGC TGCGGCGGCT 
GCCGGTCTCA GCGTGCCGGG CGTTGCCCGC GCCGTTGTTG GTCAGCAGGA AGCCATTAAA
TGGGATAAAG CGCCGTGCCG TTTCTGCGGT ACTGGTTGCG GCGTTCTGGT CGGAACGCAG
CAGGGGCGTG TGGTGGCCTG TCAGGGCGAC CCGGACGCAC CGGTTAACCG TGGCCTGAAC
TGCATTAAGG GCTATTTCCT GCCCAAGATC ATGTACGGTA AAGACCGTTT GACGCAGCCG
CTGCTGCGTA TGAAAAACGG TAAATATGAC AAAGAAGGCG AATTTACCCC AATCACCTGG
GATCAGGCCT TCGATGTGAT GGAAGAGAAA TTCAAAACCG CCCTGAAAGA AAAAGGACCG
GAATCGATCG GTATGTTCGG TTCTGGTCAG TGGACTATCT GGGAAGGTTA TGCCGCGTCC
AAGCTGTTCA AAGCGGGCTT CCGTTCGAAC AACATCGACC CGAACGCACG TCACTGTATG
GCGTCGGCAG TAGTTGGCTT TATGCGTACC TTTGGTATGG ATGAGCCGAT GGGCTGCTAT
GACGACATCG AGCAGGCTGA CGCGTTTGTG CTGTGGGGCG CTAACATGGC GGAGATGCAC
CCGATCCTCT GGTCACGCAT CACTAACCGT CGTCTCTCTA ACCAGAACGT CACCGTGGCG
GTGCTTTCTA CCTACCAGCA TCGTAGCTTC GAGCTGGCGG ATAACGGCAT CATCTTTACG
CCGCAATCTG ACCTGGTGAT CCTGAACTAC ATCGCCAACT ATATCATTCA AAACAATGCG
ATAAATCAGG ACTTCTTCAG CAAGCACGTT AACCTGCGCA AAGGGGCGAC GGACATCGGC
TACGGTTTAC GTCCGACCCA TCCGCTGGAA AAAGCAGCGA AGAATCCGGG TTCTGACGCC
TCCGAACCGA TGAGCTTTGA AGATTACAAA GCCTTCGTTG CCGAGTATAC GCTGGAAAAA
ACTGCCGAAA TGACCGGCGT GCCGAAAGAC CAGTTAGAAC AACTGGCGCA GCTGTATGCC
GATCCGAACA AGAAAGTCAT CTCCTACTGG ACGATGGGCT TCAACCAGCA TACTCGTGGC
GTGTGGGCCA ACAACCTGGT CTACAACCTG CACCTGCTGA CCGGCAAAAT TTCCCAGCCG
GGTTGCGGTC CGTTCTCCCT GACCGGGCAG CCTTCCGCGT GTGGTACTGC GCGTGAAGTG
GGCACCTTTG CTCACCGTCT GCCTGCGGAC ATGGTGGTGA CTAACGAGAA ACATCGTGAT
ATCTGCGAGA AGAAGTGGAA TATCCCGAGC GGCACCATTC CGGCGAAAAT CGGTCTGCAT
GCGGTAGCAC AAGACCGTGC GCTGAAAGAC GGCAAGCTGA ATGTTTACTG GACCATGTGT
ACCAACAACA TGCAGGCCGG GCCGAACATC AATGAAGAGC GTATGCCGGG CTGGCGCGAT
CCGCGCAACT TCATCATCGT CTCCGATCCG TATCCGACAG TCAGTGCGCT GGCCGCCGAC
TTGATCCTGC CGACCGCAAT GTGGGTAGAG AAAGAGGGCG CTTACGGTAA CGCCGAACGC
CGTACTCAGT TCTGGCGTCA GCAGGTACAG GCACCGGGCG AAGCGAAATC GGATCTCTGG
CAGTTAGTTC AGTTCTCCCG CCGCTTCAAA ACTGAAGAAG TATGGCCGGA AGAGCTGCTG
GCGAAGAAAC CGGAACTGCG TGGCAAAACG CTGTACGAAG TTCTGTATGC CACCCCCGAA
GTGAGCAAAT TCCCGGTATC CGAACTGGCG GAAGATCAGC TGAACGATGA ATCCCGCGAG
CTGGGCTTCT ATCTGCAAAA AGGGCTGTTC GAAGAGTACG CATGGTTTGG TCGCGGTCAC
GGTCACGATC TGGCACCGTT CGATGACTAC CACAAAGCGC GCGGTCTGCG CTGGCCGGTG
GTGAACGGTA AAGAAACGCA GTGGCGTTAC AGCGAAGGTA ACGACCCGTA CGTGAAAGCG
GGCGAAGGCT ACAAGTTCTA CGGTAAACCG GATGGCAAAG CGGTGATCTT CGCGCTGCCG
TTCGAACCGG CGGCGGAAGC ACCGGATGAA GAGTACGACC TGTGGCTCTC TACCGGACGC
GTCCTGGAGC ACTGGCACAC CGGCAGTATG ACTCGCCGTG TGCCGGAACT GCACCGCGCC
TTCCCGGAAG CGGTCCTGTT TATTCACCCG CTGGATGCGA AAGCGCGCGA TCTGCGCCGT
GGCGACAAAG TGAAAGTGGT TTCTCGCCGT GGCGAAGTGA TCTCGATTGT TGAAACGCGC
GGTCGTAACC GTCCGCCACA GGGCCTGGTG TACATGCCGT TCTTCGACGC CGCACAGCTG
GTTAACAAAC TGACGCTGGA TGCGACCGAT CCGCTCTCGA AAGAGACGGA CTTCAAGAAG
TGCGCGGTCA AACTGGAGAA GGTGTAA
 
Protein sequence
MKLSRRSFMK ANAVAAAAAA AGLSVPGVAR AVVGQQEAIK WDKAPCRFCG TGCGVLVGTQ 
QGRVVACQGD PDAPVNRGLN CIKGYFLPKI MYGKDRLTQP LLRMKNGKYD KEGEFTPITW
DQAFDVMEEK FKTALKEKGP ESIGMFGSGQ WTIWEGYAAS KLFKAGFRSN NIDPNARHCM
ASAVVGFMRT FGMDEPMGCY DDIEQADAFV LWGANMAEMH PILWSRITNR RLSNQNVTVA
VLSTYQHRSF ELADNGIIFT PQSDLVILNY IANYIIQNNA INQDFFSKHV NLRKGATDIG
YGLRPTHPLE KAAKNPGSDA SEPMSFEDYK AFVAEYTLEK TAEMTGVPKD QLEQLAQLYA
DPNKKVISYW TMGFNQHTRG VWANNLVYNL HLLTGKISQP GCGPFSLTGQ PSACGTAREV
GTFAHRLPAD MVVTNEKHRD ICEKKWNIPS GTIPAKIGLH AVAQDRALKD GKLNVYWTMC
TNNMQAGPNI NEERMPGWRD PRNFIIVSDP YPTVSALAAD LILPTAMWVE KEGAYGNAER
RTQFWRQQVQ APGEAKSDLW QLVQFSRRFK TEEVWPEELL AKKPELRGKT LYEVLYATPE
VSKFPVSELA EDQLNDESRE LGFYLQKGLF EEYAWFGRGH GHDLAPFDDY HKARGLRWPV
VNGKETQWRY SEGNDPYVKA GEGYKFYGKP DGKAVIFALP FEPAAEAPDE EYDLWLSTGR
VLEHWHTGSM TRRVPELHRA FPEAVLFIHP LDAKARDLRR GDKVKVVSRR GEVISIVETR
GRNRPPQGLV YMPFFDAAQL VNKLTLDATD PLSKETDFKK CAVKLEKV