Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_64710 |
Symbol | |
ID | 4384820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | + |
Start bp | 5762752 |
End bp | 5765721 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639327827 |
Product | putative extracellular heme-binding protein |
Protein accession | YP_793365 |
Protein GI | 116053046 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01785] TonB-dependent heme/hemoglobin receptor family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.572363 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGCG CCAACCCTGC GAGCGAAGAA GTCATGTCCC TACAACGCCC TATCCCCAGC CTGGCGCGTC CCTTGTCGCG CCTCTCCCCG CTCACCCTTG CCCTGCGCTG CGCCCTGCTC GGCCTGTCGG CGGTCGGCGC CGGCCTGCTC GGCGGCGGCC TGGCCCACGC GCAGAGCGTC GGCCAGCACA GCGCGGCTAC CCGCCTGGAC GACGCCACGC CGCGCAGCTA CGACATCCCG GCGGGTTCGC TGGGCGCCTC CCTGGGCGCC TTCGCCAGCC AGTCGGGCCT GCTGCTGTCC TTCGACCCCG CGCTGACCCG CGGCCGCACC GCGCCGGCGC TCAAGGGCCG CTACAGCGTA CTGGAAGGCT TGCAGCGAGT GCTGTCGAAC ACCGGCCTGC AGGTCGCGGC GGGAACCTCC GGCGCCTACC TGCTGGTCGA GCCGCGCATG TCCGGAGAAG CCCCGGCCGA CCTGTCGCCG GTGGTGGTCT CGGCCGCCGA GCTGGCCGAT CCGCAGAAGG AAACCTATAC CGCGCCGCGC TCGTCGGTGT ACCTCTCCAG CGAAGACATC GACCGCTTCG GCCGGGTTTC CGTGGGTGAC CTGCTCCAGG GCATCCCGGG CGTACAGGTC GGCGACAGCC GCAACGGTGG CGCGCTGGAC GTCAACATCC GCGGCATCCA GGGGCAGAGC CGGGTGGCGG TGCGGGTCGA CGGCGCGGAG CAGGCGCTGG ACGTCTACCG CGGCTACGCC GGCACCCAGC AGCGCAGCTA CATCGATCCC GACCTGGTCA GCAGCGTGAC CGTCGACAAG GGTCCCTCGA CCCGCTCGGG CGCGATCGGC GGCAGCGTGG AAATGCGCAC CATCGGCGTC AAGGACATCC TGGTCGACGG CAAGGACCTC GGCGTGCGCT TCACCGGCAA CGTCTGGAAC AACGGCGTCG CCCCGCAGCA CCGCAGCGCC AGTTCGAAGA CCGAGAATCT CAGCAGCGTG CCGCACGATG ACCGCGGCAG CCTGTTCGGC TCCCAGGCCA AGTCCGGCAG CGCCGCCTTC GCCTACCGCA ACGAGCATCT GGACCTGGTC GCCGCCTATG CGCAGCGCAA TCAGGGCAAC TACTTTTCCG GCAAGAAAGG CCAGGACCGC TATCGCGTCT ACAACCGCTA CGGTCGCGAG GAAAGCAGCG TAGCCAAGGT CTACAACGCA GGAGAGGAAG TGCTCAACTC GTCGTCGGAG ACCGAGTCCT ACCTGCTCAA GGCGACCTGG CGCATCGCCG ACGAGCACAC CCTGGACCTC GGCTATCGTC GCTACGACGG GCGCACCGGG GAGATCATGC CGTCGGACAT CTTCCGTTTC GGCACCGCCG GCATCTACCA GTACCCATTG AGCGAAGTGA AGATCGACAC CTACACCGCA CGCTACCGTT ACCTGCCCGA GAACAATCCG CTGGTGGACC TCAGTACCGG CCTGTGGATG ACCGAGGCGA AGAGCGACAT GCTGACCTCG GTGCTGGCAC CGCGCTCCCA GGCCTATCGC TCCGACCGCA ACTGGACGCG CCAGGACAAC CGGCGCATCG GCGGCGACCT GAACAACGTG GCGCGCTTCG AAACCGACTT CGGCGATTTC AAGCTCGACC TCGGCGGCTC GTTCCAGGTC GAAGACATCC AGCCGCAGAA AAGCGTGGTC ACCACCCTCC ACGACATCAA CGCCAACCGC ACCCTGAGGG ACGCCACCCG CCAGGAATAC GGCCTCAACG GCAAGCTCGA GTTCAAGCCG GTCGAGCGCC TGACGCTATG GGGCGGCGGC CGCTACAGCC ACTTCAACAG CAAGGACAAC GGCATCTCCG CCTCGCCGCG GCGCGAGGAT CGCGACATGC GCTTCATCAC GGTGAGCAGG CCCGGCTACT ACGGCTCGAT GATGTGGTTC CCCGACCAGA ACGGCCAGTA CACCGACGCC ACCGATCCGC GCCTCAACAA CGGCATCGTC ACCAACAACA CCAATAATCC GTTCGAAGGC ATTCCCTTCG ACGAGTTCGG CCCGGCCAAC GTGACGGTCC ATCCCTCGCG GGTCACCAAC GTGGTCACCG GCTACAACTA CAGCAAGAAG GGCAGCAGCC GCGGCGGCGG TTTCTCGCCG GCATTCGGGA TCAATTTCGA GCTGGCCCCG GATACCTTCG TCTACGCCTC CTACATCGAA GGCCTGCGCC TGCCATCGCT GTTCGAGACC AGCCAGGGCA CCCTCCAGGT CGAGCCGGGC AAGGAACTCA AGCCCGAGCG TTCGCGCAGC TGGGAGATCG GTGCCAGCGC GCTGCGCGAC AGCCTGTTGG CCGACGGCGA CTCGGCGGCG ATCAAGCTGG CCTACTTCAA CAACACCATC AAGAACTACA TCACCCGCTA TTACGATCCG GGCCAGATGG GCCTGATGAC CTTCAGCAAC ACCGACAGCT ACCGCACCAG CGGTCTCGAA CTGCAATCGC ACTACGACGC AGGGCGGGTG TTCGCCGACC TGTCGGCGAC CTACTACCTG AAGACCGAGA CCTGCGACGC CGCCTTCGCC GCCAGGCTGC GCGCCGGCGC CAACCGCTAC CAGCGCACCG AGAACACGCC GAACTGCACG CCGGGCAGCT TCATGGGCTC CTATACCAAC ACGCAGAACC CGCCGCGGCT GGCGACCAAC CTGACCGCCG GCCTGCGCTT CTTCGACCAG GCCCTGACCC TGGGCGGGCG CATGACCTAT ACCTCCGGCC CCACCGCCAC GGCGGACAAG CCCTGGCAGG TCGGCGCCAC CACACCGCAG ATCGAGTACC GCTCGGTGCA GCTGTTCGAC CTGTTCCTCA AGTACAAGCT GTTCGAGCAC ACCGAACTGA ATGCCTCGCT GCAGAACCTC ACCGACCGCT ATTACCTCGA CCCGCTGGCA CAGAGTTTCA TGCCCGCGCC CGGGCGTACC CTGCGGGTGG GGATGCAGGC GAAGTTCTGA
|
Protein sequence | MTGANPASEE VMSLQRPIPS LARPLSRLSP LTLALRCALL GLSAVGAGLL GGGLAHAQSV GQHSAATRLD DATPRSYDIP AGSLGASLGA FASQSGLLLS FDPALTRGRT APALKGRYSV LEGLQRVLSN TGLQVAAGTS GAYLLVEPRM SGEAPADLSP VVVSAAELAD PQKETYTAPR SSVYLSSEDI DRFGRVSVGD LLQGIPGVQV GDSRNGGALD VNIRGIQGQS RVAVRVDGAE QALDVYRGYA GTQQRSYIDP DLVSSVTVDK GPSTRSGAIG GSVEMRTIGV KDILVDGKDL GVRFTGNVWN NGVAPQHRSA SSKTENLSSV PHDDRGSLFG SQAKSGSAAF AYRNEHLDLV AAYAQRNQGN YFSGKKGQDR YRVYNRYGRE ESSVAKVYNA GEEVLNSSSE TESYLLKATW RIADEHTLDL GYRRYDGRTG EIMPSDIFRF GTAGIYQYPL SEVKIDTYTA RYRYLPENNP LVDLSTGLWM TEAKSDMLTS VLAPRSQAYR SDRNWTRQDN RRIGGDLNNV ARFETDFGDF KLDLGGSFQV EDIQPQKSVV TTLHDINANR TLRDATRQEY GLNGKLEFKP VERLTLWGGG RYSHFNSKDN GISASPRRED RDMRFITVSR PGYYGSMMWF PDQNGQYTDA TDPRLNNGIV TNNTNNPFEG IPFDEFGPAN VTVHPSRVTN VVTGYNYSKK GSSRGGGFSP AFGINFELAP DTFVYASYIE GLRLPSLFET SQGTLQVEPG KELKPERSRS WEIGASALRD SLLADGDSAA IKLAYFNNTI KNYITRYYDP GQMGLMTFSN TDSYRTSGLE LQSHYDAGRV FADLSATYYL KTETCDAAFA ARLRAGANRY QRTENTPNCT PGSFMGSYTN TQNPPRLATN LTAGLRFFDQ ALTLGGRMTY TSGPTATADK PWQVGATTPQ IEYRSVQLFD LFLKYKLFEH TELNASLQNL TDRYYLDPLA QSFMPAPGRT LRVGMQAKF
|
| |