Gene EcolC_1128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1128 
Symbol 
ID6068006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1230178 
End bp1233459 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content55% 
IMG OID641600544 
ProductTPR repeat-containing protein 
Protein accessionYP_001724122 
Protein GI170019168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCAG TAAAAGTGTG GCAAGAGCGC GTTGAGATCC CGACCTATGA AACCGGGCCG 
CAGGATATAC ATCCCATGTT CCTGGAAAAT CGCGTTTATC AGGGATCGTC CGGCGCGGTT
TATCCCTATG GCGTGACCGA TACGCTGAGC GAGCAGAAAA CCCTGAAATC CTGGCAGGCG
GTGTGGCTGG AAAACGACTA CATCAAAGTG ATGATCCTGC CGGAACTGGG CGGTCGTGTG
CATCGCGCAT GGGATAAAGT GAAACAACGC GATTTTGTTT ATCACAATGA AGTCATTAAA
CCTGCGCTGG TGGGGCTGCT GGGACCGTGG ATCTCCGGTG GGATTGAGTT TAACTGGCCG
CAACACCATC GCCCGACCAC CTTTATGCCC GTTGATTTCA CCCTCGAAGC CCATGAAGAC
GGCGCACAGA CGGTGTGGGT CGGCGAAACG GAGCCGATGC ATGGTTTACA GGTGATGACA
GGTTTCACCC TGCGCCCTGA CCGGGCGGCG CTGGAAATCG CCAGCCGCGT CTATAACGGC
AACGCCACGC CGCGTCATTT CTTGTGGTGG GCCAACCCGG CAGTGAAAGG GGGGGAAGGG
CATCAGAGCG TTTTCCCGCC GGATGTAACG GCGGTGTTTG ATCACGGCAA ACGGGCCGTC
TCCGCTTTCC CCATCGCCAC CGGCACTTAC TACAAAGTGG ACTACTCCGC TGGAGTGGAC
ATTTCTCGCT ATAAAAATGT GCCCGTTCCA ACCTCATATA TGGCTGAAAA ATCACAGTAC
GATTTTGTTG GCGCGTGGTG TCACGATGAA GATGGCGGTT TGCTGCACGT TGCCAACCAC
CATATTGCGC CAGGTAAAAA ACAGTGGAGT TGGGGACACA GTGAATTTGG CCAGGCGTGG
GACAAGAGCC TGACCGACAA TAACGGCCCG TATATCGAAC TGATGACCGG TATTTTTGCC
GATAACCAGC CTGATTTTAC CTGGCTTGAT GCTTACGAAG AGAAGCGTTT CGAGCAGTAT
TTCCTGCCTT ATCATTCTCT GGGCATGGTG CAAAATGCCT CCCGCGATGC GGTGATAAAA
CTCCAGCGTA GTGAGCGGGG GATTGAGTGG GGGCTGTATG CCATCTCTCC GTTGAACGGA
TACCGCCTGG CGATCCGCGA AATCGGCAAA TGCAACGCGT TACTTGATGA TGCCGTGGCC
TTGACACCAG CGACCGCCAT CCAGGGCGTG TTGCATGGTA TCAATCCTGA AAGGCTGACC
ATTGAGCTCT CCGATGCCGA CGGCAATATT GTACTGAGTT ATCAGGAACA TCAGCCGCAA
GAGTTGCCGT TGCCGGACGT CGCCAAAGCG CCACTGGCAG CACAAGACAT TACCAGTACA
GATGAAGCCT GGTTTATCGG TCAGCATCTG GAGCAATATC ATCACGCCAG CCGTTCACCG
TTCGATTACT ACCTGCGCGG TGTGGCGCTG GATCCGCTGG ATTACCGCTG TAACCTGGCG
CTGGCGATGC TGGAATATAA CCGCGCAGAT TTCCCACAAG CGGTGGCGTA TGCCACTCAG
GCGCTGAAAC GCGCACATGC GCTGAACAAA AATCCGCAGT GCGGACAGGC GAGTTTGATT
CGCGCCAGTG CTTACGAACG TCAGGGACAA TATCAACAAG CCGAAGAGGA TTTCTGGCGT
GCGGTCTGGA GCGGCAATAG TAAAGCAGGC GGCTATTATG GTCTGGCACG ACTGGCGGCG
CGTAATGGAA ACTTCGACGC GGGTCTGGAT TTTTGCCAAC ACAGTCTTCG CACCTGCCCA
ACCAATCAGG AAGTGCTTTG CCTGCATAAT CTGCTGCTGG TGTTAAGTGG TCGTCAGGAC
AACGCGCGTT TGCAGCGCGA GAAACTGCTG CGCGATTATC CGCTGAACGC CACTCTGTGG
TGGCTGAACT GGTTCGATGG TCGTAGCGAA TCAGCTCTCG CGCAGTGGCG CGGTCTGTGT
CAGGGACGCG ACGTTAACGC CCTGATGACC GCCGGGCAAC TGATTAACTG GGGAATGCCC
ACCCTGGCGG CAGAGATGCT GAACGCACTG GACTGCCAGC GCACGCTGCC GCTTTACCTG
CAAGCCAGCT TGCTGCCGAA AGCCGAACGT GGCGAACTGG TCGCAAAAGC CATTGATGTC
TTCCCGCAGT TTGTCCGTTT CCCGAATACG TTGGAAGAAG TGGCGGCGCT GGAGAGTATT
GAAGAGTGCT GGTTTGCTCG CCATTTACTG GCCTGCTTCT ACTACAACAA ACGTAGCTAC
AACAAAGCCA TTGCCTTATG GCAACGTTGC GTAGAGATGT CGCCGGAGTT TGCCGACGGC
TGGCGCGGGT TAGCGATCCA TGCGTGGAAT AAGCAACACG ATTATGAGCT GGCCGCGCGT
TATCTTGATA ATGCTTATCA GCTTGCGCCG CAGGATGCAC GTCTGCTTTT CGAACGGGAT
TTGCTTGATA AGTTAAGTGG AGCCACACCG GAGAAACGAC TGGCGCGTCT GGAAAATAAT
CAGGAAATTG CGCTGAAACG CGACGACATG ACCGCAGAAC TGCTCAATTT GTGGCATCTC
ACGGGTCAGG CAGACAAAGC GGCGGACATT CTCGCCACGC GTAAATTCCA CCCGTGGGAA
GGCGGGGAAG GAAAGGTCAC CAGTCAGTTT ATCCTCAACC AGTTATTACG CGCCTGGCAG
CATCTTGATG CCAGACAGCC GCAGCAGGCC AGCGAACTGC TTCATGCCGC GCTGCATTAT
CCGGAGAATT TAAGCGAAGG CCGTTTACCG GGGCAAACTG ATAACGACAT CTGGTTCTGG
CAGGCGATAT GCGCCAACGC GCAGGGCGAT GAAACTGAAG CGATGCGTTG TTTACGTCTG
GCGGCGACCG GCGATCGCAC CATTAACATC CACAGTTATT ACAACGATCA GCCGGTTGAT
TATCTCTTCT GGCAAGGAAT GGCGCTGCGA CTGCTGGGTG AACAGCAAAC CGCACAGCAA
CTGTTTAGTG AAATGAAACA GTGGGCGCAA GAGATGGCGA AAACCAGTAT TGAGGCGGAT
TTCTTTGCTG TTTCACAACC TGACCTGTTG TCGCTGTATG GCGATTTACA ACAGCAGCAT
AAAGAAAAAT GCCTGATGGT GGCGATGCTG GCGTCCGCGG GACTCGGGGA GGTTGCGCAA
TATGAATCTG CTCGCGCTGA ATTGACGGCG ATTAATCCGG CCTGGCCGAA AGCGGCATTA
TTCACCACCG TGATGCCTTT TATTTTTAAC TACGTTCACT AA
 
Protein sequence
MTPVKVWQER VEIPTYETGP QDIHPMFLEN RVYQGSSGAV YPYGVTDTLS EQKTLKSWQA 
VWLENDYIKV MILPELGGRV HRAWDKVKQR DFVYHNEVIK PALVGLLGPW ISGGIEFNWP
QHHRPTTFMP VDFTLEAHED GAQTVWVGET EPMHGLQVMT GFTLRPDRAA LEIASRVYNG
NATPRHFLWW ANPAVKGGEG HQSVFPPDVT AVFDHGKRAV SAFPIATGTY YKVDYSAGVD
ISRYKNVPVP TSYMAEKSQY DFVGAWCHDE DGGLLHVANH HIAPGKKQWS WGHSEFGQAW
DKSLTDNNGP YIELMTGIFA DNQPDFTWLD AYEEKRFEQY FLPYHSLGMV QNASRDAVIK
LQRSERGIEW GLYAISPLNG YRLAIREIGK CNALLDDAVA LTPATAIQGV LHGINPERLT
IELSDADGNI VLSYQEHQPQ ELPLPDVAKA PLAAQDITST DEAWFIGQHL EQYHHASRSP
FDYYLRGVAL DPLDYRCNLA LAMLEYNRAD FPQAVAYATQ ALKRAHALNK NPQCGQASLI
RASAYERQGQ YQQAEEDFWR AVWSGNSKAG GYYGLARLAA RNGNFDAGLD FCQHSLRTCP
TNQEVLCLHN LLLVLSGRQD NARLQREKLL RDYPLNATLW WLNWFDGRSE SALAQWRGLC
QGRDVNALMT AGQLINWGMP TLAAEMLNAL DCQRTLPLYL QASLLPKAER GELVAKAIDV
FPQFVRFPNT LEEVAALESI EECWFARHLL ACFYYNKRSY NKAIALWQRC VEMSPEFADG
WRGLAIHAWN KQHDYELAAR YLDNAYQLAP QDARLLFERD LLDKLSGATP EKRLARLENN
QEIALKRDDM TAELLNLWHL TGQADKAADI LATRKFHPWE GGEGKVTSQF ILNQLLRAWQ
HLDARQPQQA SELLHAALHY PENLSEGRLP GQTDNDIWFW QAICANAQGD ETEAMRCLRL
AATGDRTINI HSYYNDQPVD YLFWQGMALR LLGEQQTAQQ LFSEMKQWAQ EMAKTSIEAD
FFAVSQPDLL SLYGDLQQQH KEKCLMVAML ASAGLGEVAQ YESARAELTA INPAWPKAAL
FTTVMPFIFN YVH