Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_10049 |
Symbol | H |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 779016 |
End bp | 781577 |
Gene Length | 2562 bp |
Protein Length | 853 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | |
Product | tail component |
Protein accession | ACT42631 |
Protein GI | 253976961 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAAC CGGTAGGCGA TCTGGTCGTT GATTTGAGTC TGGATGCGGC CAGATTTGAC GAGCAGATGG CCAGAGTCAG GCGTCATTTT TCTGGTACGG AAAGTGATGC GAAAAAAACA GCGGCAGTCG TTGAACAGTC GCTGAGCCGA CAGGCGCTGG CTGCACAGAA AGCGGGGATT TCCGTCGGGC AGTATAAAGC CGCCATGCGT ATGCTGCCTG CACAGTTCAC CGACGTGGCC ACGCAGCTTG CAGGCGGGCA AAGTCCGTGG CTGATCCTGC TGCAACAGGG GGGGCAGGTG AAGGACTCCT TCGGCGGGAT GATCCCCATG TTCAGGGGGC TTGCCGGTGC GATCACCCTG CCGATGGTGG GGGCCACCTC GCTGGCGGTG GCGACCGGTG CGCTGGCGTA TGCCTGGTAT CAGGGCAACT CAACCCTGTC CGATTTCAAC AAAACGCTGG TCCTTTCCGG CAATCAGGCG GGACTGACGG CAGATCGTAT GCTGGTCCTG TCCAGAGCCG GGCAGGCGGC AGGGCTGACG TTTAACCAGA CCAGCGAGTC ACTCAGCGCA CTGGTTAAGG CGGGGGTAAG CGGTGAGGCT CAGATTGCGT CCATCAGCCA GAGTGTGGCG CGTTTCTCCT CTGCATCCGG CGTGGAGGTG GACAAGGTCG CTGAAGCCTT CGGGAAGCTG ACCACAGACC CGACGTCGGG GCTGACGGCG ATGGCTCGCC AGTTCCATAA CGTGTCGGCG GAGCAGATTG CGTATGTTGC TCAGTTGCAG CGTTCCGGCG ATGAAGCCGG GGCATTGCAG GCGGCGAACG AGGCCGCAAC GAAAGGGTTT GATGACCAGA CCCGCCGCCT GAAAGAGAAC ATGGGCACGC TGGAGACCTG GGCAGACAGG ACTGCGCGGG CATTCAAATC CATGTGGGAT GCGGTGCTGG ATATTGGTCG TCCTGATACC GCGCAGGAGA TGCTGATTAA GGCAGAGGCT GCGTATAAGA AAGCAGACGA CATCTGGAAT CTGCGCAAGG ATGATTATTT TGTTAACGAT GAAGCGCGGG CGCGTTACTG GGATGATCGT GAAAAGGCCC GTCTTGCGCT TGAAGCCGCC CGAAAGAAGG CTGAGCAGCA GACTCAACAG GACAAAAATG CGCAGCAGCA GAGCGATACC GAAGCGTCAC GGCTGAAATA TACCGAAGAG GCGCAGAAGG CTTACGAACG GCTGCAGACG CCGCTGGAGA AATATACCGC CCGTCAGGAA GAACTGAACA AGGCACTGAA AGACGGGAAA ATCCTGCAGG CGGATTACAA CACGCTGATG GCGGCGGCGA AAAAGGATTA TGAAGCGACG CTGAAAAAGC CGAAACAGTC CAGCGTGAAG GTGTCTGCGG GCGATCGTCA GGAAGACAGT GCTCATGCTG CCCTGCTGAC GCTTCAGGCA GAACTCCGGA CGCTGGAGAA GCATGCCGGA GCAAATGAGA AAATCAGCCA GCAGCGCCGG GATTTGTGGA AGGCGGAGAG TCAGTTCGCG GTACTGGAGG AGGCGGCGCA ACGTCGCCAG CTGTCTGCAC AGGAGAAATC CCTGCTGGCG CATAAAGATG AGACGCTGGA GTACAAACGC CAGCTGGCTG CACTTGGCGA CAAGGTTACG TATCAGGAGC GCCTGAACGC GCTGGCGCAG CAGGCGGATA AATTCGCACA GCAGCAACGG GCAAAACGGG CCGCCATTGA TGCGAAAAGC CGGGGGCTGA CTGACCGGCA GGCAGAACGG GAAGCCACGG AACAGCGCCT GAAGGAACAG TATGGCGATA ATCCGCTGGC GCTGAATAAC GTCATGTCAG AGCAGAAAAA GACCTGGGCG GCTGAAGACC AGCTTCGCGG GAACTGGATG GCAGGCCTGA AGTCCGGCTG GAGTGAGTGG GAAGAGAGCG CCACGGACAG TATGTCGCAG GTAAAAAGTG CAGCCACGCA GACCTTTGAT GGTATTGCAC AGAATATGGC GGCGATGCTG ACCGGCAGTG AGCAGAACTG GCGCAGCTTC ACCCGTTCCG TGCTGTCCAT GATGACAGAA ATTCTGCTTA AGCAGGCAAT GGTGGGGATT GTCGGGAGTA TCGGCAGCGC CATTGGCGGG GCTGTTGGTG GCGGCGCATC CGCGTCAGGC GGTACAGCCA TTCAGGCCGC TGCGGCGAAA TTCCATTTTG CAACCGGAGG ATTTACGGGA ACCGGCGGCA AATATGAGCC AGCGGGGATT GTTCACCGTG GTGAGTTTGT CTTCACGAAG GAGGCAACCA GCCGGATTGG CGTGGGGAAT CTTTACCGGC TGATGCGCGG CTATGCCACC GGCGGTTATG TCGGTACACC GGGCAGCATG GCAGACAGCC GGTCGCAGGC GTCCGGGACG TTTGAGCAGA ATAACCATGT GGTGATTAAC AACGACGGCA CGAACGGGCA GATAGGTCCG GCTGCTCTGA AGGCGGTGTA TGACATGGCC CGCAAGGGTG CCCGTGATGA AATTCAGACA CAGATGCGTG ATGGTGGCCT GTTCTCCGGA GGTGGACGAT GA
|
Protein sequence | MAEPVGDLVV DLSLDAARFD EQMARVRRHF SGTESDAKKT AAVVEQSLSR QALAAQKAGI SVGQYKAAMR MLPAQFTDVA TQLAGGQSPW LILLQQGGQV KDSFGGMIPM FRGLAGAITL PMVGATSLAV ATGALAYAWY QGNSTLSDFN KTLVLSGNQA GLTADRMLVL SRAGQAAGLT FNQTSESLSA LVKAGVSGEA QIASISQSVA RFSSASGVEV DKVAEAFGKL TTDPTSGLTA MARQFHNVSA EQIAYVAQLQ RSGDEAGALQ AANEAATKGF DDQTRRLKEN MGTLETWADR TARAFKSMWD AVLDIGRPDT AQEMLIKAEA AYKKADDIWN LRKDDYFVND EARARYWDDR EKARLALEAA RKKAEQQTQQ DKNAQQQSDT EASRLKYTEE AQKAYERLQT PLEKYTARQE ELNKALKDGK ILQADYNTLM AAAKKDYEAT LKKPKQSSVK VSAGDRQEDS AHAALLTLQA ELRTLEKHAG ANEKISQQRR DLWKAESQFA VLEEAAQRRQ LSAQEKSLLA HKDETLEYKR QLAALGDKVT YQERLNALAQ QADKFAQQQR AKRAAIDAKS RGLTDRQAER EATEQRLKEQ YGDNPLALNN VMSEQKKTWA AEDQLRGNWM AGLKSGWSEW EESATDSMSQ VKSAATQTFD GIAQNMAAML TGSEQNWRSF TRSVLSMMTE ILLKQAMVGI VGSIGSAIGG AVGGGASASG GTAIQAAAAK FHFATGGFTG TGGKYEPAGI VHRGEFVFTK EATSRIGVGN LYRLMRGYAT GGYVGTPGSM ADSRSQASGT FEQNNHVVIN NDGTNGQIGP AALKAVYDMA RKGARDEIQT QMRDGGLFSG GGR
|
| |