Gene ECD_10054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_10054 
Symbol
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp783980 
End bp787378 
Gene Length3399 bp 
Protein Length1132 aa 
Translation table11 
GC content57% 
IMG OID 
Producttail:host specificity protein 
Protein accessionACT42636 
Protein GI253976966 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAG GAAGCAGTAA GGGGCATACC CCGCGCGAAG CGAAGGACAA CCTGAAGTCC 
ACGCAGTTGC TGAGTGTGAT CGATGCCATC AGCGAAGGGC CGATTGAAGG TCCGGTGGAT
GGCTTAAAAA GCGTGCTGCT GAACAGTACG CCGGTGCTGG ACACTGAGGG GAATACCAAC
ATATCCGGTG TCACGGTGGT GTTCCGGGCT GGTGAGCAGG AGCAGACTCC GCCGGAGGGA
TTTGAATCCT CCGGCTCCGA GACGGTGCTG GGTACGGAAG TGAAATATGA CACGCCGATC
ACCCGCACCA TTACGTCTGC AAACATCGAC CGTCTGCGCT TTACCTTCGG TGTACAGGCA
CTGGTGGAAA CCACCTCAAA GGGTGACAGG AATCCGTCGG AAGTCCGCCT GCTGGTTCAG
ATACAACGTA ACGGTGGCTG GGTGACGGAA AAAGACATCA CCATTAAGGG CAAAACCACC
TCGCAGTATC TGGCCTCGGT GGTGATGGGT AACCTGCCGC CGCGCCCGTT TAATATCCGG
ATGCGCAGGA TGACGCCGGA CAGCACCACA GACCAGCTGC AGAACAAAAC GCTCTGGTCG
TCATACACTG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCACT GGTCGGCGTG
CAGGTGGACT CGGAGCAGTT CGGCAGCCAG CAGGTGAGCC GTAATTATCA TCTGCGCGGG
CGTATTCTGC AGGTGCCGTC GAACTATAAC CCGCAGACGC GGCAATACAG CGGTATCTGG
GACGGAACGT TTAAACCGGC ATACAGCAAC AACATGGCCT GGTGTCTGTG GGATATGCTG
ACCCATCCGC GCTACGGCAT GGGGAAACGT CTTGGTGCGG CGGATGTGGA TAAATGGGCG
CTGTATGTCA TCGGCCAGTA CTGCGACCAG TCAGTGCCGG ACGGCTTTGG CGGCACGGAG
CCGCGCATCA CCTGTAATGC GTACCTGACC ACACAGCGTA AGGCGTGGGA TGTGCTCAGC
GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTCGTG
CAGGACCGAC CGTCGGATAA GACGTGGACC TATAACCGCA GTAATGTGGT GATGCCGGAT
GATGGCGCGC CGTTCCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCATAA TGCCGTTGAG
GTGAACTGGA TTGACCCGAA CAACGGCTGG GAGACGGCGA CAGAGCTTGT TGAAGATACG
CAGGCCATTG CCCGTTACGG TCGTAATGTT ACGAAGATGG ATGCCTTTGG CTGTACCAGC
CGGGGGCAGG CACACCGCGC CGGGCTGTGG CTGATTAAAA CAGAACTGCT GGAAACGCAG
ACCGTGGATT TCAGCGTCGG CGCAGAAGGG CTTCGCCATG TACCGGGCGA TGTTATTGAA
ATCTGCGATG ATGACTATGC CGGTATCAGC ACCGGTGGTC GTGTGCTGGC GGTGAACAGC
CAGACCCGGA CGCTGACGCT CGACCGTGAA ATCACGCTGC CATCCTCCGG TACCGCGCTG
ATAAGCCTGG TTGACGGAAG TGGCAATCCG GTCAGCGTGG AGGTTCAGTC CGTCACCGAC
GGCGTGAAGG TAAAAGTGAG CCGTGTTCCT GACGGTGTTG CTGAATACAG CGTATGGGAG
CTGAAGCTGC CGACGCTGCG CCAGCGACTG TTCCGCTGCG TGAGTATCCG TGAGAACGAC
GACGGCACGT ATGCCATCAC CGCCGTGCAG CATGTGCCGG AAAAAGAGGC CATCGTGGAT
AACGGGGCGC ACTTTGACGG CGAACAGAGT GGCACGGTGA ATGGTGTCAC GCCGCCAGCG
GTGCAGCACC TGACCGCAGA AGTCACTGCA GACAGCGGGG AATATCAGGT GCTGGCGCGA
TGGGACACAC CGAAGGTGGT GAAGGGCGTG AGTTTCCTGC TCCGTCTGAC CGTAACAGCG
GACGACGGCA GTGAGCGGCT GGTCAGCACG GCCCGGACGA CGGAAACCAC ATACCGCTTC
ACGCAACTGG CGCTGGGGAA CTACAGGCTG ACAGTCCGGG CGGTAAATGC GTGGGGGCAG
CAGGGCGATC CGGCGTCGGT ATCGTTCCGG ATTGCCGCAC CGGCAGCACC GTCGAGGATT
GAGCTGACGC CGGGCTATTT TCAGATAACC GCCACGCCGC ATCTTGCCGT TTATGACCCG
ACGGTACAGT TTGAGTTCTG GTTCTCGGAA AAGCAGATTG CGGATATCAG ACAGGTTGAA
ACCAGCACGC GTTATCTTGG TACGGCGCTG TACTGGATAG CCGCCAGTAT CAATATCAAA
CCGGGCCATG ATTATTACTT TTATATCCGC AGTGTGAACA CCGTTGGCAA ATCGGCATTC
GTGGAGGCCG TCGGTCGGGC GAGCGATGAT GCGGAAGGTT ACCTGGATTT TTTCAAAGGC
AAGATAACCG AATCCCATCT CGGCAAGGAG CTGCTGGAAA AAGTCGAGCT GACGGAGGAT
AACGCCAGCA GACTGGAGGA GTTTTCGAAA GAGTGGAAGG ATGCCAGTGA TAAGTGGAAT
GCCATGTGGG CTGTCAAAAT TGAGCAGACC AAAGACGGCA AACATTATGT CGCGGGTATT
GGCCTCAGCA TGGAGGACAC GGAGGAAGGC AAACTGAGCC AGTTTCTGGT TGCCGCCAAT
CGTATCGCAT TTATTGACCC GGCAAACGGG AATGAAACGC CGATGTTTGT GGCGCAGGGC
AACCAGATAT TCATGAACGA CGTGTTCCTG AAGCGCCTGA CGGCCCCCAC CATTACCAGC
GGCGGCAATC CTCCGGCCTT TTCCCTGACA CCGGACGGAA AGCTGACCGC TAAAAATGCG
GATATCAGTG GCAGTGTGAA TGCGAACTCC GGGACGCTCA GTAATGTGAC GATAGCTGAA
AACTGTACGA TAAACGGTAC GCTGAGGGCG GAAAAAATCG TCGGGGACAT TGTAAAGGCG
GCGAGCGCGG CTTTTCCGCG CCAGCGTGAA AGCAGTGTGG ACTGGCCGTC AGGTACCCGT
ACTGTCACCG TGACCGATGA CCATCCTTTT GATCGCCAGA TAGTGGTGCT TCCGCTGACG
TTTCGCGGAA GTAAGCGTAC TGTCAGCGGC AGGACAACGT ATTCGATGTG TTATCTGAAA
GTACTGATGA ACGGTGCGGT GATTTATGAT GGCGCGGCGA ACGAGGCGGT ACAGGTGTTC
TCCCGTATTG TTGACATGCC AGCGGGTCGG GGAAACGTGA TCCTGACGTT CACGCTTACG
TCCACACGGC ATTCGGCAGA TATTCCGCCG TATACGTTTG CCAGCGATGT GCAGGTTATG
GTGATTAAGA AACAGGCGCT GGGCATCAGC GTGGTCTGA
 
Protein sequence
MGKGSSKGHT PREAKDNLKS TQLLSVIDAI SEGPIEGPVD GLKSVLLNST PVLDTEGNTN 
ISGVTVVFRA GEQEQTPPEG FESSGSETVL GTEVKYDTPI TRTITSANID RLRFTFGVQA
LVETTSKGDR NPSEVRLLVQ IQRNGGWVTE KDITIKGKTT SQYLASVVMG NLPPRPFNIR
MRRMTPDSTT DQLQNKTLWS SYTEIIDVKQ CYPNTALVGV QVDSEQFGSQ QVSRNYHLRG
RILQVPSNYN PQTRQYSGIW DGTFKPAYSN NMAWCLWDML THPRYGMGKR LGAADVDKWA
LYVIGQYCDQ SVPDGFGGTE PRITCNAYLT TQRKAWDVLS DFCSAMRCMP VWNGQTLTFV
QDRPSDKTWT YNRSNVVMPD DGAPFRYSFS ALKDRHNAVE VNWIDPNNGW ETATELVEDT
QAIARYGRNV TKMDAFGCTS RGQAHRAGLW LIKTELLETQ TVDFSVGAEG LRHVPGDVIE
ICDDDYAGIS TGGRVLAVNS QTRTLTLDRE ITLPSSGTAL ISLVDGSGNP VSVEVQSVTD
GVKVKVSRVP DGVAEYSVWE LKLPTLRQRL FRCVSIREND DGTYAITAVQ HVPEKEAIVD
NGAHFDGEQS GTVNGVTPPA VQHLTAEVTA DSGEYQVLAR WDTPKVVKGV SFLLRLTVTA
DDGSERLVST ARTTETTYRF TQLALGNYRL TVRAVNAWGQ QGDPASVSFR IAAPAAPSRI
ELTPGYFQIT ATPHLAVYDP TVQFEFWFSE KQIADIRQVE TSTRYLGTAL YWIAASINIK
PGHDYYFYIR SVNTVGKSAF VEAVGRASDD AEGYLDFFKG KITESHLGKE LLEKVELTED
NASRLEEFSK EWKDASDKWN AMWAVKIEQT KDGKHYVAGI GLSMEDTEEG KLSQFLVAAN
RIAFIDPANG NETPMFVAQG NQIFMNDVFL KRLTAPTITS GGNPPAFSLT PDGKLTAKNA
DISGSVNANS GTLSNVTIAE NCTINGTLRA EKIVGDIVKA ASAAFPRQRE SSVDWPSGTR
TVTVTDDHPF DRQIVVLPLT FRGSKRTVSG RTTYSMCYLK VLMNGAVIYD GAANEAVQVF
SRIVDMPAGR GNVILTFTLT STRHSADIPP YTFASDVQVM VIKKQALGIS VV