Gene ECD_01844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01844 
SymboltorY 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1902950 
End bp1904050 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content48% 
IMG OID 
ProductTMAO reductase III (TorYZ), cytochrome c-type subunit 
Protein accessionACT43698 
Protein GI253978028 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGGGA AAAAACGCAT TGGGTTATTG TTTTTGCTGA TAGCGGTTGT GGTTGGTGGC 
GGCGGGTTAT TGCTGGCGCA AAAAGCCTTA CATAAAACGT CGGATACAGC ATTTTGCCTT
TCCTGCCACT CGATGAGTAA ACCTTTTGAG GAATATCAGG GAACTGTCCA CTTTTCGAAC
CAGAAAGGGA TACGTGCGGA ATGTGCCGAT TGCCATATTC CAAAGTCAGG GATGGATTAT
TTATTTGCTA AATTAAAAGC ATCTAAAGAT ATTTATCATG AATTTGTTAG CGGCAAAATA
GACAGTGACG ATATGTTCGA AACTCATCGC CAGGAAATGG CCGAAACAGT ATGGAAAGAA
TTAAAAGCAA CTGACTCTGC AACGTGCCGT AGTTGCCATT CTTTTGATGC CATGGATATT
GCCTCGCAAA GTGAATCTGC GCAGAAAATG CATAACAAAG CACAAAAGGG CGGCGAAACC
TGTATCGATT GTCATAAAGG CATTGCCCAT TTTCCGCCAG AAATAAAAAT GGATGACAAC
GCGGCGCATG AGCTGGAAAG TCAGACCGCT ACTTCAGTGA CTAATGGCGC ACATATTTAT
CCTTTCAAAA CTTCTCGCAT AGGCGAGCTG GCTACCGTGA ATCCTGGTAC CGATCTCACC
GTCGTTGATG CCAGTGGCAA ACAGCCGATC GTTCTGTTGC AGGGTTATCA AATGCAGGGC
AGTGAAAACA CGCTCTACCT GGCGGCAGGT CAACGGCTGG CGCTAGCCAC ATTAAGTGAA
GAAGGTATCA AGGCGCTCAC GGTAAACGGG GAATGGCAGG CTGACGAATA CGGCAATCAA
TGGCGTCAGG CGTCTTTACA GGGTGCGCTT ACCGATCCCG CATTAGCGGA CCGTAAACCG
CTATGGCAAT ACGCTGAAAA ACTTGACGAT ACCTATTGCG CTGGTTGTCA TGCCCCTATT
GCCGCCGACC ATTACACCGT CAATGCGTGG CCGTCCATTG CCAAAGGAAT GGGGGCACGA
ACCAGCATGA GCGAAAACGA ACTGGACATT TTAACGCGGT ATTTCCAGTA CAACGCCAAA
GATATTACCG AGAAACAGTG A
 
Protein sequence
MRGKKRIGLL FLLIAVVVGG GGLLLAQKAL HKTSDTAFCL SCHSMSKPFE EYQGTVHFSN 
QKGIRAECAD CHIPKSGMDY LFAKLKASKD IYHEFVSGKI DSDDMFETHR QEMAETVWKE
LKATDSATCR SCHSFDAMDI ASQSESAQKM HNKAQKGGET CIDCHKGIAH FPPEIKMDDN
AAHELESQTA TSVTNGAHIY PFKTSRIGEL ATVNPGTDLT VVDASGKQPI VLLQGYQMQG
SENTLYLAAG QRLALATLSE EGIKALTVNG EWQADEYGNQ WRQASLQGAL TDPALADRKP
LWQYAEKLDD TYCAGCHAPI AADHYTVNAW PSIAKGMGAR TSMSENELDI LTRYFQYNAK
DITEKQ