Gene EcolC_1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1759 
Symbol 
ID6066598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1957782 
End bp1958882 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content48% 
IMG OID641601174 
ProductNapC/NirT cytochrome c domain-containing protein 
Protein accessionYP_001724736 
Protein GI170019782 
COG category[C] Energy production and conversion 
COG ID[COG3005] Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGGGA AAAAACGCAT TGGGTTATTG TTTTTGCTGA TAGCGGTTGT GGTTGGTGGC 
GGCGGGTTAT TGCTGGCGCA AAAAGCCTTA CATAAAACGT CGGATACAGC ATTTTGCCTT
TCCTGCCACT CGATGAGTAA ACCTTTTGAG GAATATCAGG GAACTGTCCA CTTTTCGAAC
CAGAAAGGGA TACGTGCGGA ATGTGCCGAT TGCCATATTC CAAAGTCAGG GATGGATTAT
TTATTTGCTA AATTAAAAGC ATCTAAAGAT ATTTATCATG AATTTGTTAG CGGCAAAATA
GACAGTGACG ATAAGTTCGA AACTCATCGC CAGGAAATGG CCGAAACAGT ATGGAAAGAA
TTAAAAGCAA CTGACTCTGC AACGTGCCGT AGTTGCCATT CTTTTGATGC CATGGATATT
GCCTCGCAAA GTGAATCTGC GCAGAAAATG CATAACAAAG CACAAAAGGG CGGCGAAACC
TGTATCGATT GTCATAAAGG CATTGCCCAT TTTCCGCCAG AAATAAAAAT GGATGACAAC
GCGGCGCATG AGCTGGAAAG TCAGACCGCT ACTTCAGTGA CTAATGGCGC ACATATTTAT
CCTTTCAAAA CTTCTCGCAT AGGCGAGCTG GCTACCGTGA ATCCTGGTAC CGATCTCACC
GTCGTTGATG CCAGTGGCAA ACAGCCGATC GTTCTGTTGC AGGGTTATCA AATGCAGGGC
AGTGAAAACA CGCTCTACCT GGCGGCAGGT CAACGGCTGG CGCTAGCCAC ATTAAGTGAA
GAAGGTATCA AGGCGCTCAC GGTAAACGGG GAATGGCAGG CTGACGAATA CGGCAATCAA
TGGCGTCAGG CGTCTTTACA GGGTGCGCTT ACCGATCCCG CATTAGCGGA CCGTAAACCG
CTATGGCAAT ACGCTGAAAA ACTTGACGAT ACCTATTGCG CTGGTTGTCA TGCCCCTATT
GCCGCCGACC ATTACACCGT CAATGCGTGG CCGTCCATTG CCAAAGGAAT GGGGGCACGA
ACCAGCATGA GCGAAAACGA ACTGGACATT TTAACGCGGT ATTTCCAGTA CAACGCCAAA
GATATTACCG AGAAACAGTG A
 
Protein sequence
MRGKKRIGLL FLLIAVVVGG GGLLLAQKAL HKTSDTAFCL SCHSMSKPFE EYQGTVHFSN 
QKGIRAECAD CHIPKSGMDY LFAKLKASKD IYHEFVSGKI DSDDKFETHR QEMAETVWKE
LKATDSATCR SCHSFDAMDI ASQSESAQKM HNKAQKGGET CIDCHKGIAH FPPEIKMDDN
AAHELESQTA TSVTNGAHIY PFKTSRIGEL ATVNPGTDLT VVDASGKQPI VLLQGYQMQG
SENTLYLAAG QRLALATLSE EGIKALTVNG EWQADEYGNQ WRQASLQGAL TDPALADRKP
LWQYAEKLDD TYCAGCHAPI AADHYTVNAW PSIAKGMGAR TSMSENELDI LTRYFQYNAK
DITEKQ