Gene ECD_04072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_04072 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4334549 
End bp4336621 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content52% 
IMG OID 
Productputative oxidoreductase 
Protein accessionACT45861 
Protein GI253980191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGC TTGATGACAC CATCCTTGAT GCGCTGACGC ACGTTACTTT CCCAAAGGGT 
TTTGCACAGG CAGAGCCCGC ATGGGTTGTC ACGGTAGACG GTGTTGATTA CCCACTCTGG
CAAACAGATG CCCTGGTAGT CGGCAGTGGT GCAGCCGGGC TGCGTGCAGC TGTGGAACTT
AAACGCCGCC AGCAAAATGT GCTGATCGCC ACTGCCGGGT TATATATGGG GACGTCGGCA
TGTTCTGGTT CCGATAAACA GACACTGTTT ACCGCCGCTA CCGCGGGCAA CGGCGACAAC
TTCACCAAAC TGGCAGAGGC ACTGGCGAGC GGTGGGGCGA TGGATCACGA CACCGCTTAT
GTCGAAGCGG TAGGTTCTCT GCACACTCTT GGCGGGTTGC AATATCTTGG TCTGGAATTA
CCGGAAGATC GCTATGGCGC GATTCTTCGT TATCAAACCG ACCATGACGA AGCCGGGCGT
GCAACCTCGT GTGGGCCGCG GACCTCAAGG TTAATGGTGA AAGTGCTGTT GGAAGAAGTA
CAGCGCCTCG CCATTCCAGT GTTGACCAGT GCAACAGTGA TTAAACTGCT GCATCAGCGT
GACGAAAACG GCGAAGACCG TGTGGCGGGG GCAATCCTCG CGACCGGTCA TCGCGCCCAT
AACCCTTGGG GGCTGGCAAT TGTGACTGCG CCCAATGTGG TACTGGCAAC AGGAGGGCCT
GGCGAGCTTT ATCGCGACAG TGTGTACCCA CACAAATGTT TTGGCTCGCT GGGGCTGGCG
CTGGAGGAAG GCCTGACGCT AACCAATCTG ACCGAAAGCC AGTTTGGTAT AGGCACGCCG
CGCAGCACGT TTCCGTGGAA TTTATCCGGC ACCTATGTAC AGGTGATCCC GTATATCTAT
TCCGTGGATG CTGAGGGTAA CGAGTATAAC TTCCTCGCGG ATTACTATCG CACCACCCAG
GAGCTGGCTT CAAACATTTT CCGTAAAGGC TATCAGTGGC CGTTCCACGC CACTCGGGTG
ATGGATTTTG GCTCCAGCTT GTTAGATATG GCAGTAGCGC AAGAGCAGCA ATCAGGGCGT
CAGGTATTTA TGGATTTCAA TCGCAATCCT GAACCTGTGC CGGGTGACCT GCCATTCTCA
TTAGAGCGAC TGGACGACGA CGTTCGCGCG TATCTGGAAA ATAACGACGC TCTGGCACCA
TCGCCCATCG AACGACTGCA ACGAATGAAT CCGCTGTCTA TCTCGCTGTA TAAGATGCAC
GGTTACGATC TCACCACGCA GCCATTGCAG TTTGCCATGA ATAATCAGCA TATGAATGGC
GGCATTGAAG TGGATATCTG GGGACAAACA TCCCTGCCCG GTTGTTTTGC CGTGGGGGAA
GTTGCTGGCA CACACGGCGT CACTCGCCCT GGTGGTGCGG CATTGAATGC CGGGCAGGTT
TTTGCTGTTC GTCTGGCACG TTTTATTGGT TGCACGCAAA AACGTAATAT TGATGGAGAT
ATAGCACAGC TGGTAGCTCA GGCACTGGCT TCTATAAGAG AGATCATTAC TCAGGCGCAC
GATAACGGGA CCGGGATGCC GTTGTCGGTT GTGAGAGAAA AAATTCAGGC ACGAATGTCT
GACCATGCGG GATTTATTTG CCATGCCGAT AAAGTCCGAC GCGCCACTCG TGATGCCCTG
CTATTGAACG AATTTGTCCA ACGGCATGGA TTGGCTATCA AACATGTGGG CGAAGTTGCC
GAGCTGTTTA TGTGGCGGCA TATGGCGCTG ACCTCTGCCG CCGTCTTAAC TCAACTGACA
CATTATATTG ATGCTGGTGG TGGCAGTCGT GGGGCACGGA TAGTTATTGA TCCACAAGGC
AAATGCCTAC CACAAACTCG TCGCGGCGCA AAAGAAGAAT GGCGTTTTCG CTCTGAACGT
GCTGAAGACA AAAATCACAG ATTAACGATT CAATATTCGC AAGGTTCTTT TATTACCGAA
GTGAAGTCTT TACGTATGCA ACCGTGTATT AACGGTATTT ACTTTGAAAA AAACTGGCCA
GACTTTTTAA ATGGAGAAAT TTACACACAA TAA
 
Protein sequence
MSQLDDTILD ALTHVTFPKG FAQAEPAWVV TVDGVDYPLW QTDALVVGSG AAGLRAAVEL 
KRRQQNVLIA TAGLYMGTSA CSGSDKQTLF TAATAGNGDN FTKLAEALAS GGAMDHDTAY
VEAVGSLHTL GGLQYLGLEL PEDRYGAILR YQTDHDEAGR ATSCGPRTSR LMVKVLLEEV
QRLAIPVLTS ATVIKLLHQR DENGEDRVAG AILATGHRAH NPWGLAIVTA PNVVLATGGP
GELYRDSVYP HKCFGSLGLA LEEGLTLTNL TESQFGIGTP RSTFPWNLSG TYVQVIPYIY
SVDAEGNEYN FLADYYRTTQ ELASNIFRKG YQWPFHATRV MDFGSSLLDM AVAQEQQSGR
QVFMDFNRNP EPVPGDLPFS LERLDDDVRA YLENNDALAP SPIERLQRMN PLSISLYKMH
GYDLTTQPLQ FAMNNQHMNG GIEVDIWGQT SLPGCFAVGE VAGTHGVTRP GGAALNAGQV
FAVRLARFIG CTQKRNIDGD IAQLVAQALA SIREIITQAH DNGTGMPLSV VREKIQARMS
DHAGFICHAD KVRRATRDAL LLNEFVQRHG LAIKHVGEVA ELFMWRHMAL TSAAVLTQLT
HYIDAGGGSR GARIVIDPQG KCLPQTRRGA KEEWRFRSER AEDKNHRLTI QYSQGSFITE
VKSLRMQPCI NGIYFEKNWP DFLNGEIYTQ