Gene ECD_02441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02441 
SymbolyphG 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2545269 
End bp2548550 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content55% 
IMG OID 
Producthypothetical protein 
Protein accessionACT44261 
Protein GI253978591 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCCAG TAAAAGTGTG GCAAGAGCGC GTTGAGATCC CGACCTATGA AACCGGGCCG 
CAGGATATAC ATCCCATGTT CCTGGAAAAT CGCGTTTATC AGGGATCGTC CGGCGCGGTT
TATCCCTACG GCGTGACCGA TACGCTGAGT GAACAGAAAA CCCTGAAATC CTGGCAGGCG
GTGTGGCTGG AAAACGACTA CATCAAAGTG ATGATCCTGC CGGAACTGGG CGGTCGGGTG
CATCGCGCAT GGGATAAAGT GAAACAGCGC GATTTTGTCT ATCACAATGA AGTCATTAAA
CCTGCGCTGG TGGGGCTGCT GGGACCGTGG ATCTCTGGCG GGATTGAGTT TAACTGGCCG
CAACACCATC GCCCGACCAC CTTTATGCCC GTTGATTTCA CCCTCGAAGC CCATGAAGAC
GGCGCACAGA CGGTGTGGGT CGGCGAAACG GAGCCGATGC ATGGTTTACA GGTGATGACA
GGTTTCACCC TGCGCCCTGA CCGGGCGGCG CTGGAAATCG CCAGCCGCGT CTATAACGGC
AACGCCACGC CGCGTCATTT CTTGTGGTGG GCCAACCCGG CAGTGAAAGG GGGGGAAGGG
CATCAGAGCG TTTTCCCGCC GGATGTGACT GCGGTGTTTG ATCACGGCAA ACGGGCCGTC
TCCGCTTTCC CCATCGCCAC CGGCACTTAC TACAAAGTGG ACTACTCCGC TGGAGTGGAC
ATTTCTCGCT ATAAAAATGT GCCCGTTCCA ACCTCATATA TGGCTGAAAA ATCACAGTAC
GATTTTGTTG GCGCGTGGTG TCACGATGAA GATGGTGGTT TGCTACACGT TGCCAACCAC
CATATTGCGC CAGGTAAAAA GCAGTGGAGC TGGGGACACA GTGAATTTGG CCAGGCGTGG
GATAAGAGTC TGACCGACAA TAACGGCCCG TATATCGAAC TGATGACCGG TATTTTTGCC
GATAACCAGC CTGATTTTAC CTGGCTTGAT GCTTACGAAG AGAAGCGTTT TGAGCAGTAT
TTCCTGCCTT ATCATTCTTT GGGCATGGTG CAAAATGCCT CCCGCGATGC GGTGATAAAA
CTCCAGCGTA GTGAGCGGGG GATTGAGTGG GGGCTGTATG CCATCTCTCC GTTGAACGGA
TACCGCCTGG CGATCCGCGA AATCGGCAAA TGCAACGCGT TACTTGATGA TGCCGTGGCA
CTGATGCCTG CGACCGCCAT CCAGGGCGTG TTGCACGGTA TCAATCCTGA AAGGCTGACC
ATTGAGCTCT CCGATGCCGA CGGCAATATT GTACTGAGTT ATCAGGAACA TCAGCCGCAA
GAGTTGCCGT TGCCGGACGT CGCCAAAGCG CCACTGGCAG CACAAGACAT TACCAGTACA
GATGAAGCCT GGTTTATCGG TCAGCATCTG GAGCAATATC ATCACGCCAG CCGTTCACCG
TTCGATTACT ACCTGCGCGG CGTGGCGCTG GATCCGCTGG ATTACCGCTG TAACCTGGCG
CTGGCGATGC TGGAATATAA CCGTGCCGAT TTCCCGCAAG CGGTGGCGTA TGCCACTCAG
GCTCTGAAAC GCGCACATGC GCTGAACAAA AATCCGCAGT GCGGACAGGC GAGTTTGATT
CGCGCCAGTG CTTACGAACG TCAGGGACAA TATCAACAAG CCGAAGAGGA TTTCTGGCGG
GCGGTCTGGA GCGGCAACAG CAAAGCCGGT GGCTATTATG GCCTGGCACG ACTGGCTGCG
CGTAATGGTA ACTTCGATGC GGGTCTGGAT TTTTGCCAAC AAAGTCTTCG CGCCTGCCCA
ACCAATCAGG AAGTGCTTTG CCTGCATAAC CTGCTGCTGG TGTTAAGTGG TCGTCAGGAC
AACGCGCGTT TGCAGCGCGA GAAACTGCTG CGCGATTATC CGCTGAACGC CACTCTGTGG
TGGCTGAACT GGTTCGATGG TCGTAGCGAA TCAGCCCTCG CGCAGTGGCG CGGTCTGTGT
CAGGGACGCG ACGTTAACGC TCTGATGACC GCCGGGCAAC TGATTAACTG GGGAATGCCC
ACCCTGGCGG CAGAGATGCT GAACGCACTG GACTGCCAGC GCACGCTGCC GCTTTACCTG
CAAGCCAGCT TGCTGCCGAA AGCCGAACGT GGCGAACTGG TCGCAAAAGC CATTGATGTC
TTCCCGCAGT TTGTCCGTTT CCCGAATACG CTGGAAGAAG TGGCGGCGCT GGAGAGTATT
GAAGAGTGCT GGTTTGCTCG CCATTTACTG GCCTGCTTCT ACTATAACAA ACGTAGCTAC
AACAAAGCCA TTGCCTTATG GCAACGTTGC GTAGAGATGT CGCCGGAGTT TGCCGACGGC
TGGCGCGGGT TAGCGATCCA TGCGTGGAAT AAGCAACACG ATTATGAGCT GGCCGCGCGT
TATCTTGATA ATGCTTATCA GCTTGCGCCG CAGGATGCAC GTCTGCTTTT CGAACGGGAT
TTGCTTGATA AGTTAAGTGG AGCCACACCG GAGAAACGAC TGGCGCGTCT GGAAAATAAT
CAGGAAATTG CGCTGAAACG CGACGATATG ACCGCAGAAC TGCTCAATTT GTGGCATCTC
ACGGGTCAGG CAGACAAAGC TGCGGACATT CTCGCCACGC GTAAATTCCA CCCGTGGGAA
GGCGGGGAAG GGAAGGTCAC CAGTCAGTTT ATCCTCAACC AGTTATTACG CGCCTGGCAG
CATCTTGATG CCAGAGAGCC GCAGCAGGCC AGCGAACTGC TTCATGCCGC GCTGCATTAT
CCGGAGAATT TAAGCGAAGG CCGTTTACCG GGGCAAACTG ATAACGACAT CTGGTTCTGG
CAGGCGATAT GCGCCAACGC GCAGGGCGAT GAAACTGAAG CGATGCGTTG TTTACGTCTG
GCGGCGACCG GCGATCGCAC CATTAACATC CACAGTTATT ACAACGATCA GCCGGTTGAT
TATCTCTTCT GGCAAGGAAT GGCGCTGCGA CTGCTGGGTG AACAGCAAAC CGCACAGCAA
CTGTTTAGTG AAATGAAACA GTGGGCGCAA GAGATGGCGA AAACCAGTAT TGAGGCGGAT
TTCTTTGCTG TTTCACAACC TGACCTGTTG TCGCTGTATG GCGATTTACA ACAGCAGCAT
AAAGAAAAAT GCCTGATGGT GGCGATGCTG GCGTCCGCGG GACTCGGGGA GGTTGCGCAA
TATGAATCTG CTCGCGCTGA ATTGACGGCG ATTAATCCGG CCTGGCCGAA AGCGGCATTA
TTCACCACCG TGATGCCTTT TATTTTTAAC TACGTTCACT AA
 
Protein sequence
MTPVKVWQER VEIPTYETGP QDIHPMFLEN RVYQGSSGAV YPYGVTDTLS EQKTLKSWQA 
VWLENDYIKV MILPELGGRV HRAWDKVKQR DFVYHNEVIK PALVGLLGPW ISGGIEFNWP
QHHRPTTFMP VDFTLEAHED GAQTVWVGET EPMHGLQVMT GFTLRPDRAA LEIASRVYNG
NATPRHFLWW ANPAVKGGEG HQSVFPPDVT AVFDHGKRAV SAFPIATGTY YKVDYSAGVD
ISRYKNVPVP TSYMAEKSQY DFVGAWCHDE DGGLLHVANH HIAPGKKQWS WGHSEFGQAW
DKSLTDNNGP YIELMTGIFA DNQPDFTWLD AYEEKRFEQY FLPYHSLGMV QNASRDAVIK
LQRSERGIEW GLYAISPLNG YRLAIREIGK CNALLDDAVA LMPATAIQGV LHGINPERLT
IELSDADGNI VLSYQEHQPQ ELPLPDVAKA PLAAQDITST DEAWFIGQHL EQYHHASRSP
FDYYLRGVAL DPLDYRCNLA LAMLEYNRAD FPQAVAYATQ ALKRAHALNK NPQCGQASLI
RASAYERQGQ YQQAEEDFWR AVWSGNSKAG GYYGLARLAA RNGNFDAGLD FCQQSLRACP
TNQEVLCLHN LLLVLSGRQD NARLQREKLL RDYPLNATLW WLNWFDGRSE SALAQWRGLC
QGRDVNALMT AGQLINWGMP TLAAEMLNAL DCQRTLPLYL QASLLPKAER GELVAKAIDV
FPQFVRFPNT LEEVAALESI EECWFARHLL ACFYYNKRSY NKAIALWQRC VEMSPEFADG
WRGLAIHAWN KQHDYELAAR YLDNAYQLAP QDARLLFERD LLDKLSGATP EKRLARLENN
QEIALKRDDM TAELLNLWHL TGQADKAADI LATRKFHPWE GGEGKVTSQF ILNQLLRAWQ
HLDAREPQQA SELLHAALHY PENLSEGRLP GQTDNDIWFW QAICANAQGD ETEAMRCLRL
AATGDRTINI HSYYNDQPVD YLFWQGMALR LLGEQQTAQQ LFSEMKQWAQ EMAKTSIEAD
FFAVSQPDLL SLYGDLQQQH KEKCLMVAML ASAGLGEVAQ YESARAELTA INPAWPKAAL
FTTVMPFIFN YVH