Gene PA14_47800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_47800 
Symbol 
ID4384691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp4250942 
End bp4252792 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content67% 
IMG OID639326409 
Productputative tonB-dependent receptor 
Protein accessionYP_791974 
Protein GI116049223 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID[TIGR01779] TonB-dependent vitamin B12 receptor 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.173562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAG TTTTCCTGAC CCCGGCAGCC GTGGCGCTGT GCGGCGCGTC TTCCCTGAGC 
CTGGCCGAAC CGGTCAGCCT TGTCGACCAG GTGGTGACCG CCACCCGTAC CGCCCAGACC
GCTTCCCAGA GCCTGGCGGC GGTGAGCGTC ATCGACCGCG AGGATATCGA GCGCAGCCAG
GCGCGCAGCG TGCCGGAGCT GTTGCGCCAG GTACCCGGCG TGTCGCTGGC GAACAACGGC
GGTTTCGGCA AGAACACCAC GCTGTTCCTG CGCGGCACCG AGTCCGACCA TGTGCTGGTG
CTGATCGACG GCATCAAGGT CGGCTCGGCC AGCGCCGGCC TCACCGCGTT CCAGGACTTG
CCGGTGGAGC TGATCGAGCG CATCGAGGTG GTCCGCGGGC CGCGTTCCAG CCTGTACGGC
TCGGAGGCCA TCGGCGGGGT GATCCAGATC TTCACCCGGC GCGGCGACGG CCAGGGCGCC
AAGCCGTTCT TCTCCGCCGG TTACGGCACC CATCAGACCC TGGAGGGCAG CGCCGGGGTC
AGCGGCGGCG CTGGCAACGG CTGGTACAGC CTCGGGGTGA GCAGTTTCGA TACGGCGGGG
ATCAACACCA AGCGCGCCGG TACTGCGGGC TATGAGCCAG ACCGAGACGG CTACCGCAAC
CTGTCCGGCA ACCTGCGCGG CGGCTATCGC TTCGACAATG GCCTGGAACT CGACGGCACC
CTGCTCAGGG CCAAGTCGCA CAACGACTAT GACCAGGTCT TCGGCAACTC CGGTTTCAAT
GCCAACGCCG ACGGCGAGCA GAACCTGGTC GGCGGCCGTG CCCGCTTCAC TCCGTTCGAT
CCCTGGCTGG TGACCCTCCA GGCCGGGCGC AGCGAGGACA AGGCCGATGC CTATCAGGAT
GGCCGTTTCT ACTCGCGCTT CGATACCCGT CGCGACAGCC TGTCCTGGCA GAACGACCTG
ACCCTGGCCG AAGGCCATGT GCTGACCCTC GGCTACGACT GGCAGAAGGA CGAGATCAGC
AGCAGCGAAG CCTTCAGCGT CGACTCGCGG CTGAACAAAG GCTGGTTCGC CCAGTACCTC
GGCCGGTACG GTCGCCAGGA TTGGCAACTG AGCCTGCGCC GCGACGACAA CCAGCAGTTC
GGCGTGCACG ATACCGGCAG CGCCGCCTGG GGCTACGCGC TGAGCGACGC GCTGCGCTTC
ACCGTCAGCT ACGGCACGGC GTTCAAGGCG CCGACCTTCA ACGAACTCTA CTACCCCGAC
TACGGCAATC CCGACCTGGA CGCCGAGACT TCGCGCAGCC TGGAAGTCGG GCTGAGCGGT
ACGCATGGCT GGGGGCACTG GGCGGTGAAT GCCTTCCGTA CCAACGTCGA CGACCTGATC
GGCAACGATC CGCGTCCGGC GCCGGGGCGC CCCTGGGGGC AGCCGAACAA CATCGACGAA
GCGCGCATCC GTGGCGTCGA ACTGGTCCTC GGCAGCCAGT GGCTGGGCTG GGACTGGAAC
GCCAACGCAA CCTTCCTCGA CCCGCAAAAC CGTTCCGGCG GCGTCAACGA CGGCAACGAG
CTGCCGCGCC GGGCGCGGCG GATGTTCAAC CTGGAACTGG ACCGGCGCTT CGAGCGTCTT
TCGCTGGGCG CCAGCGTGCA CGCCGAAGGC CGACGCTATG ATGACCCGGC CAACAAGGTG
CGCCTGGGCG GCTACGCCAC CCTCGACCTG CGCAGCGAGT ACCGGCTGAA CGACGAATGG
CGCCTGCAGG GCCGGATCGC CAACCTGTTC GGTGCCGACT ACGAAACCGC GTATGGCTAC
AACCAGCCTG GCCAGGCGGT CTACCTCAGC GTGCGCTACC AGGCCCTGTG A
 
Protein sequence
MNRVFLTPAA VALCGASSLS LAEPVSLVDQ VVTATRTAQT ASQSLAAVSV IDREDIERSQ 
ARSVPELLRQ VPGVSLANNG GFGKNTTLFL RGTESDHVLV LIDGIKVGSA SAGLTAFQDL
PVELIERIEV VRGPRSSLYG SEAIGGVIQI FTRRGDGQGA KPFFSAGYGT HQTLEGSAGV
SGGAGNGWYS LGVSSFDTAG INTKRAGTAG YEPDRDGYRN LSGNLRGGYR FDNGLELDGT
LLRAKSHNDY DQVFGNSGFN ANADGEQNLV GGRARFTPFD PWLVTLQAGR SEDKADAYQD
GRFYSRFDTR RDSLSWQNDL TLAEGHVLTL GYDWQKDEIS SSEAFSVDSR LNKGWFAQYL
GRYGRQDWQL SLRRDDNQQF GVHDTGSAAW GYALSDALRF TVSYGTAFKA PTFNELYYPD
YGNPDLDAET SRSLEVGLSG THGWGHWAVN AFRTNVDDLI GNDPRPAPGR PWGQPNNIDE
ARIRGVELVL GSQWLGWDWN ANATFLDPQN RSGGVNDGNE LPRRARRMFN LELDRRFERL
SLGASVHAEG RRYDDPANKV RLGGYATLDL RSEYRLNDEW RLQGRIANLF GADYETAYGY
NQPGQAVYLS VRYQAL