Gene ECD_10049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_10049 
Symbol
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp779016 
End bp781577 
Gene Length2562 bp 
Protein Length853 aa 
Translation table11 
GC content59% 
IMG OID 
Producttail component 
Protein accessionACT42631 
Protein GI253976961 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAC CGGTAGGCGA TCTGGTCGTT GATTTGAGTC TGGATGCGGC CAGATTTGAC 
GAGCAGATGG CCAGAGTCAG GCGTCATTTT TCTGGTACGG AAAGTGATGC GAAAAAAACA
GCGGCAGTCG TTGAACAGTC GCTGAGCCGA CAGGCGCTGG CTGCACAGAA AGCGGGGATT
TCCGTCGGGC AGTATAAAGC CGCCATGCGT ATGCTGCCTG CACAGTTCAC CGACGTGGCC
ACGCAGCTTG CAGGCGGGCA AAGTCCGTGG CTGATCCTGC TGCAACAGGG GGGGCAGGTG
AAGGACTCCT TCGGCGGGAT GATCCCCATG TTCAGGGGGC TTGCCGGTGC GATCACCCTG
CCGATGGTGG GGGCCACCTC GCTGGCGGTG GCGACCGGTG CGCTGGCGTA TGCCTGGTAT
CAGGGCAACT CAACCCTGTC CGATTTCAAC AAAACGCTGG TCCTTTCCGG CAATCAGGCG
GGACTGACGG CAGATCGTAT GCTGGTCCTG TCCAGAGCCG GGCAGGCGGC AGGGCTGACG
TTTAACCAGA CCAGCGAGTC ACTCAGCGCA CTGGTTAAGG CGGGGGTAAG CGGTGAGGCT
CAGATTGCGT CCATCAGCCA GAGTGTGGCG CGTTTCTCCT CTGCATCCGG CGTGGAGGTG
GACAAGGTCG CTGAAGCCTT CGGGAAGCTG ACCACAGACC CGACGTCGGG GCTGACGGCG
ATGGCTCGCC AGTTCCATAA CGTGTCGGCG GAGCAGATTG CGTATGTTGC TCAGTTGCAG
CGTTCCGGCG ATGAAGCCGG GGCATTGCAG GCGGCGAACG AGGCCGCAAC GAAAGGGTTT
GATGACCAGA CCCGCCGCCT GAAAGAGAAC ATGGGCACGC TGGAGACCTG GGCAGACAGG
ACTGCGCGGG CATTCAAATC CATGTGGGAT GCGGTGCTGG ATATTGGTCG TCCTGATACC
GCGCAGGAGA TGCTGATTAA GGCAGAGGCT GCGTATAAGA AAGCAGACGA CATCTGGAAT
CTGCGCAAGG ATGATTATTT TGTTAACGAT GAAGCGCGGG CGCGTTACTG GGATGATCGT
GAAAAGGCCC GTCTTGCGCT TGAAGCCGCC CGAAAGAAGG CTGAGCAGCA GACTCAACAG
GACAAAAATG CGCAGCAGCA GAGCGATACC GAAGCGTCAC GGCTGAAATA TACCGAAGAG
GCGCAGAAGG CTTACGAACG GCTGCAGACG CCGCTGGAGA AATATACCGC CCGTCAGGAA
GAACTGAACA AGGCACTGAA AGACGGGAAA ATCCTGCAGG CGGATTACAA CACGCTGATG
GCGGCGGCGA AAAAGGATTA TGAAGCGACG CTGAAAAAGC CGAAACAGTC CAGCGTGAAG
GTGTCTGCGG GCGATCGTCA GGAAGACAGT GCTCATGCTG CCCTGCTGAC GCTTCAGGCA
GAACTCCGGA CGCTGGAGAA GCATGCCGGA GCAAATGAGA AAATCAGCCA GCAGCGCCGG
GATTTGTGGA AGGCGGAGAG TCAGTTCGCG GTACTGGAGG AGGCGGCGCA ACGTCGCCAG
CTGTCTGCAC AGGAGAAATC CCTGCTGGCG CATAAAGATG AGACGCTGGA GTACAAACGC
CAGCTGGCTG CACTTGGCGA CAAGGTTACG TATCAGGAGC GCCTGAACGC GCTGGCGCAG
CAGGCGGATA AATTCGCACA GCAGCAACGG GCAAAACGGG CCGCCATTGA TGCGAAAAGC
CGGGGGCTGA CTGACCGGCA GGCAGAACGG GAAGCCACGG AACAGCGCCT GAAGGAACAG
TATGGCGATA ATCCGCTGGC GCTGAATAAC GTCATGTCAG AGCAGAAAAA GACCTGGGCG
GCTGAAGACC AGCTTCGCGG GAACTGGATG GCAGGCCTGA AGTCCGGCTG GAGTGAGTGG
GAAGAGAGCG CCACGGACAG TATGTCGCAG GTAAAAAGTG CAGCCACGCA GACCTTTGAT
GGTATTGCAC AGAATATGGC GGCGATGCTG ACCGGCAGTG AGCAGAACTG GCGCAGCTTC
ACCCGTTCCG TGCTGTCCAT GATGACAGAA ATTCTGCTTA AGCAGGCAAT GGTGGGGATT
GTCGGGAGTA TCGGCAGCGC CATTGGCGGG GCTGTTGGTG GCGGCGCATC CGCGTCAGGC
GGTACAGCCA TTCAGGCCGC TGCGGCGAAA TTCCATTTTG CAACCGGAGG ATTTACGGGA
ACCGGCGGCA AATATGAGCC AGCGGGGATT GTTCACCGTG GTGAGTTTGT CTTCACGAAG
GAGGCAACCA GCCGGATTGG CGTGGGGAAT CTTTACCGGC TGATGCGCGG CTATGCCACC
GGCGGTTATG TCGGTACACC GGGCAGCATG GCAGACAGCC GGTCGCAGGC GTCCGGGACG
TTTGAGCAGA ATAACCATGT GGTGATTAAC AACGACGGCA CGAACGGGCA GATAGGTCCG
GCTGCTCTGA AGGCGGTGTA TGACATGGCC CGCAAGGGTG CCCGTGATGA AATTCAGACA
CAGATGCGTG ATGGTGGCCT GTTCTCCGGA GGTGGACGAT GA
 
Protein sequence
MAEPVGDLVV DLSLDAARFD EQMARVRRHF SGTESDAKKT AAVVEQSLSR QALAAQKAGI 
SVGQYKAAMR MLPAQFTDVA TQLAGGQSPW LILLQQGGQV KDSFGGMIPM FRGLAGAITL
PMVGATSLAV ATGALAYAWY QGNSTLSDFN KTLVLSGNQA GLTADRMLVL SRAGQAAGLT
FNQTSESLSA LVKAGVSGEA QIASISQSVA RFSSASGVEV DKVAEAFGKL TTDPTSGLTA
MARQFHNVSA EQIAYVAQLQ RSGDEAGALQ AANEAATKGF DDQTRRLKEN MGTLETWADR
TARAFKSMWD AVLDIGRPDT AQEMLIKAEA AYKKADDIWN LRKDDYFVND EARARYWDDR
EKARLALEAA RKKAEQQTQQ DKNAQQQSDT EASRLKYTEE AQKAYERLQT PLEKYTARQE
ELNKALKDGK ILQADYNTLM AAAKKDYEAT LKKPKQSSVK VSAGDRQEDS AHAALLTLQA
ELRTLEKHAG ANEKISQQRR DLWKAESQFA VLEEAAQRRQ LSAQEKSLLA HKDETLEYKR
QLAALGDKVT YQERLNALAQ QADKFAQQQR AKRAAIDAKS RGLTDRQAER EATEQRLKEQ
YGDNPLALNN VMSEQKKTWA AEDQLRGNWM AGLKSGWSEW EESATDSMSQ VKSAATQTFD
GIAQNMAAML TGSEQNWRSF TRSVLSMMTE ILLKQAMVGI VGSIGSAIGG AVGGGASASG
GTAIQAAAAK FHFATGGFTG TGGKYEPAGI VHRGEFVFTK EATSRIGVGN LYRLMRGYAT
GGYVGTPGSM ADSRSQASGT FEQNNHVVIN NDGTNGQIGP AALKAVYDMA RKGARDEIQT
QMRDGGLFSG GGR