Gene ECD_02159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02159 
SymbolyfaL 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2230778 
End bp2234530 
Gene Length3753 bp 
Protein Length1250 aa 
Translation table11 
GC content52% 
IMG OID 
Productadhesin 
Protein accessionACT43982 
Protein GI253978312 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATTA TCTTTCTACG CAAGGAGTAT TTATCTTTAC TCCCGTCAAT GATTGCATCT 
CTTTTCTCTG CTAACGGTGT CGCGGCGGTC ACTGATTCAT GCCAGGGATA TGATGTCAAA
GCGAGTTGTC AGGCCAGCAG GCAAAGCCTT TCAGGCATTA CGCAGGACTG GAGTATCGCT
GATGGGCAAT GGCTGGTTTT TTCGGATATG ACCAATAACG CCAGCGGTGG GGCCGTATTT
TTGCAACAAG GAGCGGAATT TTCACTATTA CCAGAAAATG AAACTGGAAT GACTCTGTTT
GCCAATAACA CCGTTACAGG AGAATATAAT AACGGCGGGG CCATATTTGC TAAAGAAAAC
TCAACGCTGA ATCTTACTGA TGTTATTTTT TCCGGTAACG TCGCAGGCGG CTATGGTGGC
GCAATCTATT CTTCTGGTAC TAACGATACT GGTGCCGTCG ATTTACGTGT CACTAACGCC
ATGTTTCGCA ATAACATCGC TAATGATGGC AAAGGTGGCG CAATTTATAC CATTAATAAT
GACGTTTATT TAAGTGATGT TATTTTTGAT AACAACCAGG CATATACATC AACAAGTTAC
AGTGATGGCG ATGGCGGGGC AATCGATGTT ACCGATAATA ATAGCGACAG CAAGCATCCT
TCAGGTTATA CGATAGTAAA TAACACTGCC TTTACAAATA ACACTGCCGA AGGTTATGGC
GGGGCGATAT ATACCAATAG CGTGACGGCT CCCTATCTTA TTGATATTTC TGTTGATGAC
AGCTACAGCC AGAACGGAGG CGTGTTAGTC GATGAGAACA ATAGCGCAGC AGGCTATGGA
GATGGTCCTT CCTCTGCGGC GGGTGGCTTT ATGTATCTCG GCTTAAGTGA AGTTACCTTT
GATATTGCCG ACGGAAAAAC GCTGGTTATT GGCAATACAG AGAATGACGG AGCTGTTGAC
TCTATTGCTG GTACCGGGTT AATCACCAAA ACAGGTTCCG GCGATCTGGT ACTTAATGCA
GATAACAATG ACTTTACTGG TGAGATGCAG ATTGAAAACG GTGAAGTTAC CCTGGGCCGC
AGCAACTCCC TGATGAATGT CGGCGATACG CATTGCCAGG ACGATCCGCA AGACTGCTAC
GGTCTGACGA TAGGGAGTAT TGATCAGTAT CAGAATCAGG CTGAGCTAAA CGTTGGCTCG
ACCCAACAAA CTTTTGTGCA CGCATTGACG GGCTTTCAGA ATGGCACTTT AAATATCGAT
GCTGGTGGCA ACGTTACTGT TAATCAGGGC AGTTTTGCTG GCATCATCGA AGGTGCTGGT
CAGCTCACCA TTGCGCAAAA CGGCAGCTAC GTGCTGGCAG GGGCGCAGCC GATGGCGCTA
ACCGGCGATA TAGTCGTTGA TGATGGTGCG GTGCTTTCGC TGGAAGGCGA CGCGGCAGAT
CTTACCGCTC TCCAGGACGA TCCGCAGTCG ATCGTGTTAA ACGGCGGTGT GCTCGATCTC
TCTGATTTCT CCACCTGGCA GAGCGGCACA TCATACAACG ATGGCCTTGA AGTCAGTGGC
AGCAGCGGAA CGGTTATCGG CAGTCAGGAT GTAGTAGATC TTGCAGGTGG CGACAATTTG
CATATCGGCG GCGACGGGAA AGATGGCGTC TACGTGGTGG TCGATGCGAG CGACGGGCAG
GTAAGTCTGG CAAACAATAA TAGTTATTTG GGCACAACAC AAATCGCCTC CGGTACGCTG
ATGGTGAGCG ACAACTCGCA GCTTGGAGAT ACCCACTATA ACCGCCAGGT TATCTTTACC
GATAAGCAAC AAGAAAGCGT GATGGAGATT ACCTCCGACG TTGACACGCG TTCAGATGCG
GCAGGCCACG GACGTGATAT TGAAATGCGC GCCGACGGTG AAGTGGCAGT TGATGCGGGG
GTAGACACGC AGTGGGGCGC ACTGATGGCT GACAGCAGCG GGGAGCATCA GGATGAGGGT
AGCACATTGA CGAAAACGGG GGCGGGTACG CTGGAGCTGA CCGCCAGCGG TACAACGCAG
TCGGCGGTAC GTGTCGAAGA AGGCACCCTG AAAGGTGATG TTGCGGATAT CCTTCCTTAT
GCTTCGTCAC TGTGGGTTGG TGATGGGGCA ACGTTCGTTA CTGGCGCGGA TCAGGATATT
CAGTCAATTG ATACTACTTC CAGCGGCACT ATCGACATCA GCGATGGTAC GGTTTTGCGC
CTGACCGGGC AGGATACTTC CGTCGCCCTT AATGCCTCAC TGTTTAACGG CGATGGGACG
CTGGTGAATG CCACCGATGG TGTGACGTTG ACAGGTGAGC TTAATACCAA CCTTGAAACT
GACAGCCTGA CTTATCTTTC CAACGTGACG GTTAATGGCA ATCTGACCAA TACGTCCGGT
GCGGTTAGCC TGCAAAATGG CGTCGCTGGC GATACGCTGA CGGTAAACGG TGATTATACC
GGCGGCGGTA CGCTACTGCT CGATAGCGAA TTAAACGGCG ATGACTCGGC AAGCGACCAA
CTGGTATTGA ACGGTAATAC TGCTGGCAAT ACCACGGTTG TGATTAATCC CATTACAGGG
ATTGGTGAGC CGATATCTAC AGGCATTAAA GTGGTTGATT TTGCAGCTGA TCCCACTCAG
TTTCAAAACA ATGCGCAGTT CAGTCTGGCA GGCAGCGGCT ACGTCAATAT GGGAGCGTAT
GACTACACGC TGGTGGAAGA TAACAACGAC TGGTATCTGC GATCGCAAGA AGTAACGCCG
CCATCGCCAC CTGATCCAGA CCCGACTCCC GATCCTGATC CCACGCCGGA TCCTGACCCA
ACCCCCGACC CGGAACCTAC GCCTGCTTAC CAGCCGGTGT TGAATGCCAA AGTTGGCGGT
TATCTCAATA ACCTGCGGGC GGCAAATCAG GCGTTTATGA TGGAGCGACG CGATCACGCA
GGTGGCGATG GTCAGACGCT GAATTTACGT GTTATCGGCG GAGATTATCA TTACACAGCA
GCGGGGCAAC TGGCTCAACA TGAAGACACT TCTACGGTGC AGCTTAGCGT CGACCTGTTT
AGGGGGCGCT GGGGCGATGA TGGCGAGTGG ATGCTGGGGA TTGTAGGTGG CTACAGCGAT
AACCAGGGCG ACAGCCGCTC GAGTATGACC GGAACTCGCG CCGATAACCA GAACCACGGT
TATGCGGTTG GGCTGACCTC AAGCTGGTTT CAGCACGGTA AGCAGAAGCA GGGGGCCTGG
CTGGATAACT GGTTGCAGTA CGCGTGGTTT AGCAATGATG TTTCTGAACA TGAAGATGGC
ACAGATCATT ACCATTCGTC GGGGATTATC GCCTTGCTGG AGGCGGGGTA TCAGTGGTTA
CCGGGGCGTG GTGTGGTGAT TGAACCGCAG GCGCAGGTGA TTTATCAGGG CGTGCAGCAG
GATGATTTTA CCGCCGCTAA CCGTGCGCGC GTGTCACAAT CGCAGGGTGA TGATATTCAG
ACGCGGCTGG GTTTACACAG CGAATGGCGT ACGGCTGTTC ATGTCATACC AACATTAGAT
CTGAATTATT ATCACGATCC CCATTCGACG GAAATTGAAG AGGATGGCAG CACTATCAGT
GACGATGCGG TGAAGCAACG GGGTGAAATA AAAGTGGGAG TCACGGGCAA TATCAGTCAG
CGAGTGTCCC TGCGCGGCAG CGTGGCGTGG CAGAAAGGGA GTGATGATTT TGCCCAGACG
GCAGGGTTTT TGTCGATGAC GGTGAAATGG TAA
 
Protein sequence
MRIIFLRKEY LSLLPSMIAS LFSANGVAAV TDSCQGYDVK ASCQASRQSL SGITQDWSIA 
DGQWLVFSDM TNNASGGAVF LQQGAEFSLL PENETGMTLF ANNTVTGEYN NGGAIFAKEN
STLNLTDVIF SGNVAGGYGG AIYSSGTNDT GAVDLRVTNA MFRNNIANDG KGGAIYTINN
DVYLSDVIFD NNQAYTSTSY SDGDGGAIDV TDNNSDSKHP SGYTIVNNTA FTNNTAEGYG
GAIYTNSVTA PYLIDISVDD SYSQNGGVLV DENNSAAGYG DGPSSAAGGF MYLGLSEVTF
DIADGKTLVI GNTENDGAVD SIAGTGLITK TGSGDLVLNA DNNDFTGEMQ IENGEVTLGR
SNSLMNVGDT HCQDDPQDCY GLTIGSIDQY QNQAELNVGS TQQTFVHALT GFQNGTLNID
AGGNVTVNQG SFAGIIEGAG QLTIAQNGSY VLAGAQPMAL TGDIVVDDGA VLSLEGDAAD
LTALQDDPQS IVLNGGVLDL SDFSTWQSGT SYNDGLEVSG SSGTVIGSQD VVDLAGGDNL
HIGGDGKDGV YVVVDASDGQ VSLANNNSYL GTTQIASGTL MVSDNSQLGD THYNRQVIFT
DKQQESVMEI TSDVDTRSDA AGHGRDIEMR ADGEVAVDAG VDTQWGALMA DSSGEHQDEG
STLTKTGAGT LELTASGTTQ SAVRVEEGTL KGDVADILPY ASSLWVGDGA TFVTGADQDI
QSIDTTSSGT IDISDGTVLR LTGQDTSVAL NASLFNGDGT LVNATDGVTL TGELNTNLET
DSLTYLSNVT VNGNLTNTSG AVSLQNGVAG DTLTVNGDYT GGGTLLLDSE LNGDDSASDQ
LVLNGNTAGN TTVVINPITG IGEPISTGIK VVDFAADPTQ FQNNAQFSLA GSGYVNMGAY
DYTLVEDNND WYLRSQEVTP PSPPDPDPTP DPDPTPDPDP TPDPEPTPAY QPVLNAKVGG
YLNNLRAANQ AFMMERRDHA GGDGQTLNLR VIGGDYHYTA AGQLAQHEDT STVQLSVDLF
RGRWGDDGEW MLGIVGGYSD NQGDSRSSMT GTRADNQNHG YAVGLTSSWF QHGKQKQGAW
LDNWLQYAWF SNDVSEHEDG TDHYHSSGII ALLEAGYQWL PGRGVVIEPQ AQVIYQGVQQ
DDFTAANRAR VSQSQGDDIQ TRLGLHSEWR TAVHVIPTLD LNYYHDPHST EIEEDGSTIS
DDAVKQRGEI KVGVTGNISQ RVSLRGSVAW QKGSDDFAQT AGFLSMTVKW