Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_20010 |
Symbol | hasR |
ID | 4381373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | + |
Start bp | 1721937 |
End bp | 1724612 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639324152 |
Product | heme uptake outer membrane receptor HasR precursor |
Protein accession | YP_789740 |
Protein GI | 116051427 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01785] TonB-dependent heme/hemoglobin receptor family protein [TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0486368 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATC GTGGATGGAG TGCCGTGCGA GGCGGTCGCA AGGGGGCGCA ACTGGCCCTG GGGCTGGGCC TGGTCCTGCT GGGAACGGCC GCGCTACCGC TGCATGCGCA AGACGGGGCG GACTCGGCGA GCCAGCAGCA GACCGCGCTG CGCCGGGTCC GGCTGGACAT CCCGGCACAA CCGCTGAACC GCGCCCTGCT GCGATTCGCC GAGCAGGCCG GGGTCCAGGT GTTCTTCGAC AGCCAGCGTT TCGCCGGTCT CGGCAGCGCA GCGGTGCACG GCGAATATGT GCTGGCCGAC GGCCTGAGCC AGATGCTCCA GGGCAGCCCG GTGGAATACC GCTTCTCCGG CAAGGACCAA TTGAGCCTGA TCCGCGTCAG CCAGGACGAC CTGGTGCAGA TGTCGCCCTC GGTGATCTCC GCCGCGCGTC CGGACGACTG GGTCTACCAG ACGCCGCATT CGGTCAGCGT GATCGGCCGC GAGCAGATCG AGCGCAACCC GCCGCGGCAT GCCGCCGACA TGCTCGAGGA AACCCCCGGG GTGTACTCCT CGGTGAGCCA GCAGGACCCC GGCCTGTCGG TGAACATCCG TGGCATCCAG GACTATGGGC GGGTCAACAT GTCGGTCGAC GGCATGCGCC AGAACTACCA GCAGAGCGGC CACCAGCAGC GCAACGGGAC GCTCTACGTC GATCCCGAGC TGCTCAGCGA GGTGGTCATC GACAAGGGCG CCAGCTCGGC CATGGGTGGC GCCGGGGTGA TCGGCGGGAT CGCCAACTTC CGCACCCTGG AGGCCCGCGA CCTGGTCAGG CCGGGCAAGC AGGTCGGTGG CCGGGTGCGC CTCACCAGCG GCCTGGGCGG CGACGCCAAC GGCACCCACT TCATCGGCAG CGCGGCCTTC GCCATCGGCA CCGAGGTCTG GGACATGCTG GTCGCCGCCA GCGAGCGCCA CCTGGGCGAC TACGACCCGG GGACCAAGGG CAGCATTGGC GAGCTGCGCA CCGGCGCCTG GTTCAATCCC GAGGCCGGGC AGCGGGTCAA GCATTCGCCG GTGGCCTACT CAGGCTATGT GATGCGCTCG CGACTGGCCA AGCTTGGCGT CGCCCTGCCG CAGGACCAGC GCCTGCAGTT CAGCTACCTG ACCACCCAGG TGTCCTACGA CGACGCCAAC ATGCTCAACA CCGAGAACCA GGCGCTCTGG GAAAAGCTCG GCAGCAGCGA CGTGCGTGCG CAGAACTTCG CCATCGACTA CGGCTACGCG CCGGACAACC CGTTGGTGGA CTTCAAGGCC AAGCTCTACT ACGTCGACAA CCGCAACCGC CAGCAGACCC TGCAACGCGG TATCACCCCC GGCTACTCGA TCACCTACCA GACCGACACC TACGGCGCGC AGGCGCAGAA CACCTCGACC TTCGCTCTCG ACGATCTCTC CACGCTGCGC GCCAACTATG GCCTGGAGTT CTTCTACGAC AAGGTGCGGC CGGACTCCAG CCAGCCGCGG GCGAGCACCT CGGCGGTCGG CTTCCCTGCG GCCGAAGGCA TGACCCCCAA GGGCGACCGC GCCCTCGGTA GCCTGTTCGC CCGTCTCGAC TACGACTACG ACGACTGGCT CAACCTCAAC GCCGGGCTGC GTTACGACCG CTACCGCCTG CGCGGCGACA CCGGCTTCAA CGCGCGCACC TTCATCCTTG GCACCACCCG GCAGACCGAC ATGCCGCTGC AATACGCCGT CGACCGCGAG GAGGGGCGCT TCTCGCCGAC CTTCGGCCTG TCGGTCAAGC CCGGCGTCGA CTGGTTGCAG TTGTTCGCCA CCTACGGCAA GGGCTGGCGC CCGCCGGCGG TGACCGAGAG CCTGATCACC GGCCGCCCCC ACGGCGGCGG CGCGGAAAAC ATGTACCCCA ACCCGTTCCT CAGTCCGGAG CGCTCGAAGG CCTGGGAAGT CGGCTTCAAC GTGCTGAAGG AGAACCTCTG GTTCAGCGAC GACCGCCTGG GCCTGAAGGT CGCCTACTTC GACACCCGGG TCGACGACTT CATCTTCATG GGCATGGGCA TGCAGCCGCC GGGCTACGGC ATGGCCGGGA TCGGCAACAG CGCCTACGTC AACAACCTCG ACAGCACGCG CTTCCGTGGC GTCGAGTACC AGCTCGACTA CGATGCCGGG CTGGCCTACG GGCAGCTCTC CTACACGCAC ATGATCGGCA GCAACGACTT CTGTTCGAAG ACCGCCTGGC TCGGTGGCGT CACCCAGACG GTGAAGGGCA GCGGTCGCCG CCCGCCGGTG ATCGACATGC GGCCGGACGA GCAGGCCAAC GCCGCCACCC ATTGCAGCGC GGTGCTCGGC TCCGCCGAAC ACATGCCGAT GGACCGCGGC TCGCTGACCC TGGGCATGCG CTTCTTCGAC CGCAGGCTGG ACGTCGGCGC CCGTGCCCGC TACAGCGAAG GCTACTCGGT GGCCGGCGGC GCCACGGTGT CGCAGGCCGG CGTGTACCCG GCGGACTGGA AGGAATACAC CGTCTACGAC CTGTACGGCA GCTACCGGGT GAGCGACGAG CTGACCCTGC GCCTGGCCAT GGAGAACGTC ACCGACCGCG CCTACCTGGT GCCGCTGGGC GACGTGCTGG CCTTCACCCT CGGCCGCGGG CGGACCCTGC AAGGCACCCT CGAATACCAG TTCTGA
|
Protein sequence | MKHRGWSAVR GGRKGAQLAL GLGLVLLGTA ALPLHAQDGA DSASQQQTAL RRVRLDIPAQ PLNRALLRFA EQAGVQVFFD SQRFAGLGSA AVHGEYVLAD GLSQMLQGSP VEYRFSGKDQ LSLIRVSQDD LVQMSPSVIS AARPDDWVYQ TPHSVSVIGR EQIERNPPRH AADMLEETPG VYSSVSQQDP GLSVNIRGIQ DYGRVNMSVD GMRQNYQQSG HQQRNGTLYV DPELLSEVVI DKGASSAMGG AGVIGGIANF RTLEARDLVR PGKQVGGRVR LTSGLGGDAN GTHFIGSAAF AIGTEVWDML VAASERHLGD YDPGTKGSIG ELRTGAWFNP EAGQRVKHSP VAYSGYVMRS RLAKLGVALP QDQRLQFSYL TTQVSYDDAN MLNTENQALW EKLGSSDVRA QNFAIDYGYA PDNPLVDFKA KLYYVDNRNR QQTLQRGITP GYSITYQTDT YGAQAQNTST FALDDLSTLR ANYGLEFFYD KVRPDSSQPR ASTSAVGFPA AEGMTPKGDR ALGSLFARLD YDYDDWLNLN AGLRYDRYRL RGDTGFNART FILGTTRQTD MPLQYAVDRE EGRFSPTFGL SVKPGVDWLQ LFATYGKGWR PPAVTESLIT GRPHGGGAEN MYPNPFLSPE RSKAWEVGFN VLKENLWFSD DRLGLKVAYF DTRVDDFIFM GMGMQPPGYG MAGIGNSAYV NNLDSTRFRG VEYQLDYDAG LAYGQLSYTH MIGSNDFCSK TAWLGGVTQT VKGSGRRPPV IDMRPDEQAN AATHCSAVLG SAEHMPMDRG SLTLGMRFFD RRLDVGARAR YSEGYSVAGG ATVSQAGVYP ADWKEYTVYD LYGSYRVSDE LTLRLAMENV TDRAYLVPLG DVLAFTLGRG RTLQGTLEYQ F
|
| |