Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_58900 |
Symbol | |
ID | 4382663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | + |
Start bp | 5246954 |
End bp | 5251207 |
Gene Length | 4254 bp |
Protein Length | 1417 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639327336 |
Product | large exoprotein |
Protein accession | YP_792888 |
Protein GI | 116052573 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.0117693 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGA GCTATACGCT GGTCTGGAAC CAGGCCACAG GCTGTTGGAA CGTCGCAAGC GAAGGTACCC GTCGGCGCAG CAAGAGCGGA CGCGGCAAGG CGCTCGTAGT CGCCGGAGCG TCACTGCTCG GCCTGTTCTG CCAGGCCCCC GCCTTCGCCC TGCCCAGCGG CGCCACGGTC GTTTCAGGCG ATGCCGGATT CCAGACATCC ACCGATGGCC GGCATATGGT CATCGACCAG CAGAGCCACA AGCTGATCAC TAATTGGAAC GAGTTCAGCG TCCGTGCCGA TGAGCGGGTC AGCTTCCACC AGCCGGGCCA GGACGCCGTC GCCCTGAACC GGGTGATCGG CCGCAACGGC AGCGATATCC AGGGGCGGAT AGATGCCAAC GGCAAGGTCT TCCTGGTCAA TCCCAACGGT GTGGTCTTCG GCAAGTCCGC CCAGGTCAAC GTAGGCGGCC TGGTGGCTTC CACCCTGGAC CTGGCCGACA GGGACTTCCT CGCCGGCAAC TACCAGTTCT CCGGCGACTC CGGCGCAACC GTAAGCAATG CCGGCAGCCT GAAAGCCAGC GAAGGCGGCA GCATCGCCCT GCTGGGCGCC CGGGTCAGCA ACGACGGCGT GATCCAGGCG CAACTCGGCG CCGTGGCCCT GGGCGCAGGC CAGGGCATCA ACCTCAATTT CGACGGCGAC GGCCTGCTCA ACCTGCAGGT GGACAAGGGC TCGGTCGACG CTCTTGCACA CAACGGCGGC CTCATCCGCG CCGATGGCGG CCAGGTGCTG ATGAGCGCCC GCAGCGCCGA CAGCCTGCTC AAGACCGTCG TCAACAACCA GGGCACTCTC GAGGCCAGGA CGCTACGCAG CGCGGAAGGA CGCATCGTCC TCGACGGCGG CGAACAGGGT ACCGTGCGGG TGGCCGGCAA GCAGGACGCC AGCGCCATCG GCGGAGGCAA TGGCGGCCTG GTGCTGAACC AGGGCGCGAA CGTCGAGATA CAGCGAACCG CGCAGGTGGA CACCCATGCC GACCAGGGCG CAACCGGCAC CTGGAGGATT CTCTCGCACG AGGTCAGCGT AGCCGCTGTC GGCCAGGCAA ACGCTGCCGG TGATGGTTCC GGCCAGGTCC ATGTAGCGCA GGGCCCAGCC GGGGCCAATG CGTCCGATAG CAACGGCGTG ACCATCGTTC AGCAGCAGCC GGCCGTCGAC CTCGCCGCCG GCGCCAACGG TACCTCCGCA GTGCAGAGCC AGAGCGGCGC CAACATCGGC TCGGGCGCAA GTGGCATCAG CGTCGTGCAA AGCCAGAACA GCCCCAATAT CGGCTCGGGC GCCAATGGCA TCAGCGTCGT GCAAAGCCAG AATGGCGCCA ACATCGGCGC CGGCGCGAGT GGCATCAGCG TCGTGCAGAG CCAGAACAGC CCCAATATCG GCTCGGGCGT CAATGGCGTG ACTGTCGTGC AGAGCCAGAA CGGTGCCAAT ATCGGTTCGG GCGCAAGTGG CATCACCGTT GTGCAAAGCC AGAATGGCGC AAATATCGGT TCAGGCGCGA GTGGCATCAG CGTCGTGCAG AGCCAGAGCG GCCCCAGCAT CGGCTCGGGC GTCAATGGCG TCACAATCGT GCAGAGCCAG AGCGGTGCCA ACATCGGCCC CGGCGTCAGC GGAATCGATG TCGTCCAGAC CCAGACTCTC CCCAACCTGA GCCCAGGCGC CAATGGCTCC AGCATCGTCC AGGTCCAGAC GCTACCCGAT ATTGCCGCCG ACGCCGGCAA TGTGCATGTC GTCCAGGTCC AGACCGGCGG TAACAAGGTC TTCGGCAATT CCGCCACCAA CGTCAGGTCA CGTACCGTTC AGGCCCGGAG CAGCGAGAAT GTCGGTTCCG GCCTGGCGAA TCCAAGCAGC GCGGGAAAAG GTCCGACGCT GCATGCCGAT ACCCTGGCCC GCAACCTTTC CACAAGCAAC GTCGAAGTGG TCGCCACCCG GGGCAACGCG CATGTCGGCG CGCCGCTGTC CTGGGACAGC GGCAACGGCC TGACGCTAAC CGCCGAGCGC GGGGACCTCA GGATCAATGG CGCGCTGACG GCCCAGGGGG AAAACGCCAG CCTTACTCTC AATGCCGGGC AGCGCCCTCT CCGTATCGAC GACAGCCTCT CTCTCACTGG CCAGGGAGCC CGGGTCGAAT TCAATTCGGA CAAGGGTTAT GCCCTCGCCG AAGGCGCCCG GATCACCCTG TCCGGCAAGA ACGCAGGATT CCGCGCCAAT GGGCGGGACT ACAGCGTGAT CCAGGACCTG CAGCAGTTGC GCGGCATCGA TAGGGACCTG GGCGGCAGCT ATGTCCTCGG CAATCGAATC GCAGGAGGCA ATTCCAGCTT CCTGTCGATA GGCAACGCCA GCGCCTTCGG CGGTACCTTC GACGGCCTGG GCAACACCAT CGATAATCTT GCCGTGTACG GCACCGGTGC CTACTCCGGC CTGTTCAGCG TCAACCGGGG CACCCTCCGC AACCTGAACC TGGAACGCAT TTCCGCCGAT GGAGCACAGG CCACCCACTA CAACGTCCAG GTCGGTAGCC TGGCCGCCGT CAACCTCGGT CGCATCGACA ATGTGAACGC CAGCGACATC CGTATCGCCG CGGCCTCGAA GCTGAACAGC CTCGGCGGGC TGGTCGCACT GAACCTGGGT AGTATCGACA ACGCCAGCGC CAGCGGCACG CTGGTCGGCA ACCGCCACAC CTATGCTCTG GGCGGACTCG CAGCCGAAAA CATCAGCACA GCCAGGGGCG TGGCCAGCAT CTCCAACAGC CGGGCCGATT TTGCCATCTC CGGCCAGTTG AAGGACCATG CCAGCCACTA CGGCGCGGGG GGCCTGGTAG GCAGGAACCG CGGCGGCCTC ATCCGCAGCA GCGGCAGCCA GGGAACGCTG TCGCTGAGCG GTCACGGGAT GAACCTGGGA GGACTGGTCG GATACAGCTC CGCCGGCGGA CTGGCGGACG TATCCGCCTT CGTCGACGTC TCAGGCAACG GACAGCACGG CCTGTACGGA GGGCTCATCG GCCTCAACGT AAACAGTGGT ATCGCCCACG CCACGGCCAG CGGCAAGGTC CGGGGCACAG ACGCGGAAGC ACTGGGCGGG CTGATCGGCC GGAACCTGAA CGCGGCCATC ACCAACGCCA GCGCCCATGG CGACGTCGTC CTGCAAGCCG GTCGCTACTT GGGAGGCCTG ATCGGCCACA ACCAGGCAGG CAACCTGGCC GACGTCAGTG CCAGCGGCAA CCTGAGTGGT GGGTCGCTGC TCCAGGCCGG CGGCCTGATC GGTCTCAACG CCAATGCCTC GCTGGTCAAT GCCTCCGCCA AGGGCAACGT CGCTACCCGC GGAGCAGAAG CGGTTGGCGG TCTGCTCGGC GAAAACCTGT ACGGCTCCAT CATCAACGGT TCCGCCAGTG GCGAAGTCAC CGACGGCAGC GGCAAAACCC TGGGTGGCCT GATAGGGTCC AACCTCGGCG GCAATCATTC CAACCTGAAG GCCTCCGGGT GGGTAAACGC AGGGGCGAAC AGTGACGTGG GAGGGCTGAT CGGCCACAAC CGGGGCGGCA ACCACAGCAC CCTGGCCGCA TCCGGCAATG TCACCGGGGG CAAGGGCAGT CGCGTCGGCG GACTCGTCGG CTACAACGAT GCCGCCTCGC TGACGAACGT CTCGGCTTCG GGCAACGTCA GCGCCAACGG TTCCAGGGCC ATCGGCGGGT TGCTCGGCAG CGATCTGCGA GGCTCGCTGA TGCTCGCCAG CAGCTATGGG ACGGTGATCG ACATGACCGG CCACAACCTG GGAGGACTGC TCGGCCGCGG TGAAAATACC TCGATCCGCT CCGCCAACGC CACCGGCGCG GTCACCGGGG GCGGCGGCGC CAGTGTCGGC GGGCTGGTCG GCTCCCTGGA AGGCTGGCGG GCTCTCGTCC TGGGGGCCTC GGCCAGCGGC GATGCGAGGG CGGGATATGA CAGCTATATC GGCGGGCTGG CAGGCTTCAG CACCGGCACC ATCAGGGGCG CCTCCGCTTC CGGCAAGGTC GGAGGCTCGG GCCTGCTGGG CGGCCTGGTC GCCTGGAACC AGGGGAATGT CATGGGTTCT TCGGCCAGCG GCAGGCTGGA GCCGCAAATC CCCAACCAGA TCCATGGCGG ACTGATCGGC ATCAATTTTG GCTGGCAGTC CTGGAACTCG GTATACGGGG CTGCGGCGAC CGTTCCCATG ATAGGTCGCC ACTACAACCT GTGA
|
Protein sequence | MNKSYTLVWN QATGCWNVAS EGTRRRSKSG RGKALVVAGA SLLGLFCQAP AFALPSGATV VSGDAGFQTS TDGRHMVIDQ QSHKLITNWN EFSVRADERV SFHQPGQDAV ALNRVIGRNG SDIQGRIDAN GKVFLVNPNG VVFGKSAQVN VGGLVASTLD LADRDFLAGN YQFSGDSGAT VSNAGSLKAS EGGSIALLGA RVSNDGVIQA QLGAVALGAG QGINLNFDGD GLLNLQVDKG SVDALAHNGG LIRADGGQVL MSARSADSLL KTVVNNQGTL EARTLRSAEG RIVLDGGEQG TVRVAGKQDA SAIGGGNGGL VLNQGANVEI QRTAQVDTHA DQGATGTWRI LSHEVSVAAV GQANAAGDGS GQVHVAQGPA GANASDSNGV TIVQQQPAVD LAAGANGTSA VQSQSGANIG SGASGISVVQ SQNSPNIGSG ANGISVVQSQ NGANIGAGAS GISVVQSQNS PNIGSGVNGV TVVQSQNGAN IGSGASGITV VQSQNGANIG SGASGISVVQ SQSGPSIGSG VNGVTIVQSQ SGANIGPGVS GIDVVQTQTL PNLSPGANGS SIVQVQTLPD IAADAGNVHV VQVQTGGNKV FGNSATNVRS RTVQARSSEN VGSGLANPSS AGKGPTLHAD TLARNLSTSN VEVVATRGNA HVGAPLSWDS GNGLTLTAER GDLRINGALT AQGENASLTL NAGQRPLRID DSLSLTGQGA RVEFNSDKGY ALAEGARITL SGKNAGFRAN GRDYSVIQDL QQLRGIDRDL GGSYVLGNRI AGGNSSFLSI GNASAFGGTF DGLGNTIDNL AVYGTGAYSG LFSVNRGTLR NLNLERISAD GAQATHYNVQ VGSLAAVNLG RIDNVNASDI RIAAASKLNS LGGLVALNLG SIDNASASGT LVGNRHTYAL GGLAAENIST ARGVASISNS RADFAISGQL KDHASHYGAG GLVGRNRGGL IRSSGSQGTL SLSGHGMNLG GLVGYSSAGG LADVSAFVDV SGNGQHGLYG GLIGLNVNSG IAHATASGKV RGTDAEALGG LIGRNLNAAI TNASAHGDVV LQAGRYLGGL IGHNQAGNLA DVSASGNLSG GSLLQAGGLI GLNANASLVN ASAKGNVATR GAEAVGGLLG ENLYGSIING SASGEVTDGS GKTLGGLIGS NLGGNHSNLK ASGWVNAGAN SDVGGLIGHN RGGNHSTLAA SGNVTGGKGS RVGGLVGYND AASLTNVSAS GNVSANGSRA IGGLLGSDLR GSLMLASSYG TVIDMTGHNL GGLLGRGENT SIRSANATGA VTGGGGASVG GLVGSLEGWR ALVLGASASG DARAGYDSYI GGLAGFSTGT IRGASASGKV GGSGLLGGLV AWNQGNVMGS SASGRLEPQI PNQIHGGLIG INFGWQSWNS VYGAAATVPM IGRHYNL
|
| |