Gene EcHS_A1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1968 
SymboltorY 
ID5593004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1976203 
End bp1977303 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content48% 
IMG OID640921113 
Producttrimethylamine N-oxide reductase III, c-type cytochrome subunit TorY 
Protein accessionYP_001458662 
Protein GI157161344 
COG category[C] Energy production and conversion 
COG ID[COG3005] Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGGGA AAAAACGCAT TGGGTTATTG TTTTTGCTGA TAGCGGTTGT GGTTGGTGGC 
GGCGGGTTAT TGCTGGCGCA AAAAGCCTTA CATAAAACGT CGGATACAGC ATTTTGCCTT
TCCTGCCACT CGATGAGTAA ACCTTTTGAG GAATATCAGG GAACTGTCCA CTTTTCGAAC
CAGAAAGGGA TACGTGCGGA ATGTGCCGAT TGCCATATTC CAAAGTCAGG GATGGATTAT
TTATTTGCTA AATTAAAAGC ATCTAAAGAT ATTTATCATG AATTTGTTAG CGGCAAAATA
GACAGTGACG ATAAGTTCGA AACTCATCGC CAGGAAATGG CCGAAACAGT ATGGAAAGAA
TTAAAAGCAA CTGACTCTGC AACGTGCCGT AGTTGCCATT CTTTTGATGC CATGGATATT
GCCTCGCAAA GTGAATCTGC GCAGAAAATG CATAACAAAG CACAAAAGGG CGGCGAAACC
TGTATCGATT GTCATAAAGG CATTGCCCAT TTTCCGCCAG AAATAAAAAT GGATGACAAC
GCGGCGCATG AGCTGGAAAG TCAGACCGCT ACTTCAGTGA CTAATGGCGC ACATATTTAT
CCTTTCAAAA CTTCTCGCAT AGGCGAGCTG GCTACCGTGA ATCCTGGTAC CGATCTCACC
GTCGTTGATG CCAGTGGCAA ACAGCCGATC GTTCTGTTGC AGGGTTATCA AATGCAGGGC
AGTGAAAACA CGCTCTACCT GGCGGCAGGT CAACGGCTGG CGCTAGCCAC ATTAAGTGAA
GAAGGTATCA AGGCGCTCAC GGTAAACGGG GAATGGCAGG CTGACGAATA CGGCAATCAA
TGGCGTCAGG CGTCTTTACA GGGTGCGCTT ACCGATCCCG CATTAGCGGA CCGTAAACCG
CTATGGCAAT ACGCTGAAAA ACTTGACGAT ACCTATTGCG CTGGTTGTCA TGCCCCTATT
GCCGCCGACC ATTACACCGT CAATGCGTGG CCGTCCATTG CCAAAGGAAT GGGGGCACGA
ACCAGCATGA GCGAAAACGA ACTGGACATT TTAACGCGGT ATTTCCAGTA CAACGCCAAA
GATATTACCG AGAAACAGTG A
 
Protein sequence
MRGKKRIGLL FLLIAVVVGG GGLLLAQKAL HKTSDTAFCL SCHSMSKPFE EYQGTVHFSN 
QKGIRAECAD CHIPKSGMDY LFAKLKASKD IYHEFVSGKI DSDDKFETHR QEMAETVWKE
LKATDSATCR SCHSFDAMDI ASQSESAQKM HNKAQKGGET CIDCHKGIAH FPPEIKMDDN
AAHELESQTA TSVTNGAHIY PFKTSRIGEL ATVNPGTDLT VVDASGKQPI VLLQGYQMQG
SENTLYLAAG QRLALATLSE EGIKALTVNG EWQADEYGNQ WRQASLQGAL TDPALADRKP
LWQYAEKLDD TYCAGCHAPI AADHYTVNAW PSIAKGMGAR TSMSENELDI LTRYFQYNAK
DITEKQ