Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PC1_2177 |
Symbol | |
ID | 8133121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pectobacterium carotovorum subsp. carotovorum PC1 |
Kingdom | Bacteria |
Replicon accession | NC_012917 |
Strand | - |
Start bp | 2499259 |
End bp | 2504451 |
Gene Length | 5193 bp |
Protein Length | 1730 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644865465 |
Product | filamentous hemagglutinin family outer membrane protein |
Protein accession | YP_003017752 |
Protein GI | 253688562 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAG TAAAAACCAC ACAGCGCCTG CTGGCGTACA CGCTGATCCA CCTGATTGCG TTTCAGCCGC TGCTGCCGGC CATGGCGGCG GGCGTACAGG TCGCGACGGG CAACACCGCG CTGGATCAGG CCGGTAACGG CGTCCCGGTC ATCAATATCG CCACGCCAAA CAGCGCGGGA ATATCCCACA ACCAGTACCA GGATTTTAAC GTTGATAAAC CCGGCCTGAT CCTGAATAAC GGCACGGCGC AGCTCAATCC CACCCAGCTT GGCGGGCTGA TCCAGAACAA CCCTAACCTG AAAGGCAAGG CCGCCGATGC CATTATTAAC GAAGTGGTGT CGACCAACCG CAGTACGCTG GCGGGGTATC TGGAAGTGGG GGGCAAACAG GCCAGCGTCA TCGTGGCCAA CCCGAACGGC ATTACCTGCG ACGGCTGCGG TTTTATCAAT ACCCCGCAGG TGACGCTGAC TACCGGCAAA CCCCAGCTGG ATGCGCAGGG GAACCTGCAG CATATCGACG TGACCCGCGG TGATATCACG CTGACCGGGC AGGGGCTGGA TGCAAGTAAA AGTGACTACC TCAGCCTGAT TGCCCGTACC GCGCAGATTA ACGCCGGGCT GAATGCCAAT GACACGCAGA TTGTGCTGGG GGCTAATCAG GTTGATGCAA CAGGCAAGGT GACGGCGCAG GCAGCGGACA GCGGTGTGAA GGTTGCACTG GATACGGGCG CGCTGGGTGG CATGTACACC AACCGCATCA GGCTGGTTTC CAGCGATAAG GGCGTGGGCG TCAACGTCGG CAACCTGAGT GCGCGTAGCG GGGATATCAC GCTGTCGGCC AACGGCAAGC TCAGCCTCGG TGATACGGTG GCGCAGGGGA ATATTCAGGC TGACGCGGAT GCGCTGGCGC TGCGGGGCAA GCAGCAGGCG GGTGACGCAC TGATGCTGAG CGCGAAGCAG GACATCACGC TGCAGGACGC CACGCTGCGT GCCGGGCAGG ATATCGCTCT CCGGTCAGAC GGGACGCTGA AGGCACAAAA CAGCGTCATC AGCGCGGGCG TTGATGCGCA GGGCATCGTC AAGTCCGCCA ATCAGTTATC CCTTATCGGT GACGCTGTCA CGCTTGAGGC TACCCAGCTT GCCGCCGGCA AGGTGACGGT CAACGCGGGC CAGTCGCTGC AGCAGGATGC GGCAAGTGGT ATCAAAGCGG AGTCGGTGCT CGATATACGC GGCGACGCGG TGTCGCTGGC GGGCAGCGCG GGGGGTGAAG ACGTACGACT GGCGGCGAAG ACCCTTACCG GTGCCGGCAG CGCACAGCTT CAGGCGAAGA ATAATGCCAT GGTGCGCGTT ACGCAGCAGG GCGACTGGCA GGGAAACCTG ACCGCCGGTA ATGTGCTCAC CGTTGACGGC GGACGTCTGG TGCAGCGTGG CACGCTGGCG GGGAAAACCC TCACGCTGAC GCTGGATGCG CTGGATAATC AGGGCGATAT CGCCGCGCGG CAGGGGCTGA CGTTCAGCGG CGGCGACATG ACGAACCGCG GTACGCTGGC GGCGGCCGAG CGCCTGACGG TTAACGCGCA GCGTCTGGAC AACCTCGGTT TACTCAGCGC TCGCCATGAC GTGACGCTTG AGCTGCAAAC GGTGCTGAAC AATCAGGGCA ATATTCTGAC GGATAATCAG CTGTTTTTGC TGGCCGACAC CATCACCAAC GGTGGCACAC TGCAGGCCGA CAACGCACTG CAGCTTGAGG CCACGCGCGC CCTGACGCAA TCCGCGACCG GCTCACTGCT GGCTGGCACG GACCTGACGG TAAACGCCGG ACAGGCCGAG ACCGACGGTG CCATTCAGGC GCAGCAATTC CTGCTGAATG CCGCACGCTG GCTGAATTCG GGCAAAACCA GTCTTACCGG CGACGGACAG ATTACGGCCG CCCATCTGGA TAACCGCGGC AGCCTGTTGA CGGCGGGCAA CTGGACCATT CACAGCGATG TTGTCAGCCA GGCCGGGACG TTGCAGGGAA ATGCTCTGAC CCTTCAGGCC AATACTCTTA CGAGCAGCGG ACAGGCACAG GCGCAGGGCG CGGTGAACCT GACCGTCGCC GATGCCTTTA CCAATAGTGG CGACTGGCTC AGCGGAGAGT CGCTGCGTCT TCAGGCGGCG AAGACCGAGA ACCGGGGCAC GCTGCAGGCG CTAACCCTGA CGGCGGAGGG TTCGTCTTTC GACAACCGTG GCACGATTAG CGGTATCACT AACCTGTCGC TCTTCCTGAA CGGCAATCTG GCTAACAGTG GCACGCTGCA GGGCAATCAG CTTCATGCGG AGGCGGCACA GCTGACCAAT CAGGGCACGC TTCACGGCGC CGACGCACTG ACGCTGTCCA TCACCGGCAA TCTGACTAAT CAGGGAACAC TGCTGAGTGA CGGTGACAGC ACCACCACGG CTAAACTGTT CGATAATCAG GGTACGTTGC AGGCGAAAAA CGTCACGCTG CAGGTGGATG AACTGGATAA TGCGGGGAAC ATCTTCGGTG TGTCCTCACT GGTGCTGACG GCGGCCAGCG GGCTGACCAA CCGGGAAGCG GGGAAACTGC TGTCTCAGGG CGCGGCGGTG CTCACCGCGG CGGAGGTGGT GAATACCGGA GAGTGGCAGG CGAAAACGCT GACGCTGACG GCGAATAATC TCACCAATGA CGGCCAGATT CAGGGCGATG ACGCCTTGTC TTTGACGCTG CCGATGACCG ACGGCAAAGG GACGTTAATC AACCGCGGCA CGCTGACCAC GGGCGGTGAC GCCACGCTGT TTGCGCGCCT GATGGAGAAT CCGGGCACGC TGTCTAGTCG GGGTCAGACG ACATTGACGG GCATGTCGCT GATGAACGAC GGGAGAGTGG TTGCAGCGAC GGGACTGTCG CTGCGTGGCG ACTATCAGGG CCGTGGCCTG CTGAATACCG CGGGCACCCT GACGCTGGAT GGCGACACGC TGGTTAACGG CGGGCACTGG GAAAGCCGGG CGTTGTCCCT GCAAGGGAAG CACTTGACCA ATCAGGGCAC GGTGCTGGGG GACATGGTCG CGCTTTCTGT GGATCGTCTG GTTAACCACG GTGACATCAC CGGTGTTGAC ACGCTAACGC TGTCGCTTGG TGACAGCCTG AATAATATCG GCACGTTGCG CAGCGACAGC CTGTCGGTTG CGGCGGTGGA GGTGAACAAC CGCGGTGACA TGCAGGGCAT CAATACCCTG CAACTGAACA CCACCGGACT GCTGGATAAC ACGGGCGTCA TCAGCGGCAG CCAGTCTGTG GCGGTGACAG CCGGTGACGT CAGACAGGGC GGTACGCTGG AAGGGAAACG CGTCACGCTG GACGCCGCCT CGCTGGTCAA TCAGGGCAAA ATGCTGGGCG TGGACGCGCT GACGCTGTCG ATTGCCGGTA ATTTTTCCAA TGACGGCAAT CTGCTGACGC AGGGTAATGC AACGGTGACC GCGCAGAGCA TGGATAACCA CGGGCTGATG CAGGCGGTAA ACCTCACGCT GGAAGCGGAT GAGGTGACCA ATGCCGGGCA ACTGCTCGGT ATTCAGGCGC TGTCGGTAAC GGCACAGGGC GGGCTGACCA ATCAACAAAG CGGCAAACTG CTCACACAGG GGGCGGCGGT GTTACAGGCC GCCCAAGCGG AGAACCACGG CGAGTGGCTG GCGGATAATC TGACGCTGCA GACCGCCCGT TTCGCTAACA CGGGACGGAT TCAGGCCGAT CGCGATCTTG ATATAACGGT CACGCCCGCA AGTGCCGCAC GTCAGCGTTC ATTCCTGCCG ATGGCGCTGT CGCTGGCGGC AAATATTCAG CAACTCAACG CGTCATCTTC ACGCCAGGGC GACGGAGCCA CAGTCGGTGT ACTGGACAAC CGCGGCACGC TGGTGTCGGG CGGTGACCTG CAACTGCATG CCACGCAAAT CACCAATCAG GGCTCGCTTT CCGGTAACGG CACGGCAACC CTGACGGGCA ACACGATCCA GAATGACGGT GCCGTGGTCG CCCTGACATC CCTGTCGCTG AAGGGGAATT ATCAGGGTAG CGGGACGCTG CAAACGGACG GGCGGCTGGA CTGGTCCGGT ACGACGCTCA CCAACCGCGG GCGCTGGCAG GCGAACGCCA TTCAGTTGCA GGGAATAACG CTGGATAATC AGGGCACGCT GCTGGGGCAG CGCACTGACA TCACGGCCGA TAGCCTGTTT AACGGCGGCG AAATCGCAGG GGTTGACGCA CTGCAACTGA CCGTTGCCGA CCGTCTGACC AATCAGGGCC AGCTTTATGG CGCGACGCTG GGGCTGTCGG CGACCGGCCT GTTCAATCAG GGCGAACTCT CCGGTGACGA CCTGAACCTG ACGTTGCAGG AGGGGTTGCA CAACAGCGGT CTGATTAGCG GCAGCCAGCG GGTACAGCTG GAGGCGGAGC AGGTCGCACA GTCGGGTTCA CTGGAAAGTC GCCAGTTGCA GGTGCAGGCG AACGCACTGG ACAATCAGGG CACGATGCTG GGCATGGATG CACTGACGCT GGCGATTAAC ACCACGGCGC GCAACAGCGG GAAGTGGCTG AGTCAGGGCG ACAGCACGCT GACTGCCAGC CGACTGGAAA ACCGTGGGCA ATGGCAGGCG AAGACGCTGA CGCTGACGGC CGATGACGTC GAGAATGCCG GACAACTGCT GGGGCTGTCA TCGCTGTCGC TGACGGCGAA AAACAAGCTA AGCAACGCGC AGACTGGCAC ATTGCTTACC CAGGGACTCG CGGTATTGCG CGCGGCCAGT GCCGACAATG ACGGTGAATG GCAGGCGGAC AGCCTGACGC TGGACGCACA AAACCTGAAC AACCGTGGGC ACATTCAGGG TGATACATCG CTGAAGGCGA CGCTGGCAAA CGGCGACGTC ACCAATCAGG GCACACTGTG GAGCAAGAGT GCCGATATTG CAGCCCGTAC GCTGACCAAT GCGGGTGAGA TCACCGGCGT GAACGGTCTT CAACTGACAC TGGATGACGC CTTAACCAAT CAGGGCGCAC TGAGCAGCTA TCACCTGACC GCACAGGCAG GCAGCCTCGA CAATAGCGGC ACAATCACCG GCCTTGACCG GCTCGAACTG ACCGTTGGCA ATAGCCTTGA TAATCAGGGC ACGCTGTACG GCGCGGCGCG GCGCTGGCGC TGA
|
Protein sequence | MKPVKTTQRL LAYTLIHLIA FQPLLPAMAA GVQVATGNTA LDQAGNGVPV INIATPNSAG ISHNQYQDFN VDKPGLILNN GTAQLNPTQL GGLIQNNPNL KGKAADAIIN EVVSTNRSTL AGYLEVGGKQ ASVIVANPNG ITCDGCGFIN TPQVTLTTGK PQLDAQGNLQ HIDVTRGDIT LTGQGLDASK SDYLSLIART AQINAGLNAN DTQIVLGANQ VDATGKVTAQ AADSGVKVAL DTGALGGMYT NRIRLVSSDK GVGVNVGNLS ARSGDITLSA NGKLSLGDTV AQGNIQADAD ALALRGKQQA GDALMLSAKQ DITLQDATLR AGQDIALRSD GTLKAQNSVI SAGVDAQGIV KSANQLSLIG DAVTLEATQL AAGKVTVNAG QSLQQDAASG IKAESVLDIR GDAVSLAGSA GGEDVRLAAK TLTGAGSAQL QAKNNAMVRV TQQGDWQGNL TAGNVLTVDG GRLVQRGTLA GKTLTLTLDA LDNQGDIAAR QGLTFSGGDM TNRGTLAAAE RLTVNAQRLD NLGLLSARHD VTLELQTVLN NQGNILTDNQ LFLLADTITN GGTLQADNAL QLEATRALTQ SATGSLLAGT DLTVNAGQAE TDGAIQAQQF LLNAARWLNS GKTSLTGDGQ ITAAHLDNRG SLLTAGNWTI HSDVVSQAGT LQGNALTLQA NTLTSSGQAQ AQGAVNLTVA DAFTNSGDWL SGESLRLQAA KTENRGTLQA LTLTAEGSSF DNRGTISGIT NLSLFLNGNL ANSGTLQGNQ LHAEAAQLTN QGTLHGADAL TLSITGNLTN QGTLLSDGDS TTTAKLFDNQ GTLQAKNVTL QVDELDNAGN IFGVSSLVLT AASGLTNREA GKLLSQGAAV LTAAEVVNTG EWQAKTLTLT ANNLTNDGQI QGDDALSLTL PMTDGKGTLI NRGTLTTGGD ATLFARLMEN PGTLSSRGQT TLTGMSLMND GRVVAATGLS LRGDYQGRGL LNTAGTLTLD GDTLVNGGHW ESRALSLQGK HLTNQGTVLG DMVALSVDRL VNHGDITGVD TLTLSLGDSL NNIGTLRSDS LSVAAVEVNN RGDMQGINTL QLNTTGLLDN TGVISGSQSV AVTAGDVRQG GTLEGKRVTL DAASLVNQGK MLGVDALTLS IAGNFSNDGN LLTQGNATVT AQSMDNHGLM QAVNLTLEAD EVTNAGQLLG IQALSVTAQG GLTNQQSGKL LTQGAAVLQA AQAENHGEWL ADNLTLQTAR FANTGRIQAD RDLDITVTPA SAARQRSFLP MALSLAANIQ QLNASSSRQG DGATVGVLDN RGTLVSGGDL QLHATQITNQ GSLSGNGTAT LTGNTIQNDG AVVALTSLSL KGNYQGSGTL QTDGRLDWSG TTLTNRGRWQ ANAIQLQGIT LDNQGTLLGQ RTDITADSLF NGGEIAGVDA LQLTVADRLT NQGQLYGATL GLSATGLFNQ GELSGDDLNL TLQEGLHNSG LISGSQRVQL EAEQVAQSGS LESRQLQVQA NALDNQGTML GMDALTLAIN TTARNSGKWL SQGDSTLTAS RLENRGQWQA KTLTLTADDV ENAGQLLGLS SLSLTAKNKL SNAQTGTLLT QGLAVLRAAS ADNDGEWQAD SLTLDAQNLN NRGHIQGDTS LKATLANGDV TNQGTLWSKS ADIAARTLTN AGEITGVNGL QLTLDDALTN QGALSSYHLT AQAGSLDNSG TITGLDRLEL TVGNSLDNQG TLYGAARRWR
|
| |