Gene PA14_52020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_52020 
Symbol 
ID4380023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp4616482 
End bp4617519 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content68% 
IMG OID639326766 
Producthypothetical protein 
Protein accessionYP_792329 
Protein GI116048870 
COG category[S] Function unknown 
COG ID[COG3249] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00203193 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCATAG CCCGACTCTT CGTTCTCTGT TTTTCCCTGC TAGGCTTGCC CGTCTTCGCG 
GCAACGGTCC CCAATCTCTA CCAGGTCCAC GAGCCGGTTT CGTCGCAGCA GCCCGGGGAG
CGCGATGCCG GACTGGTGCG AGCCCTGCAG ACCCTGCTGG TGCGCCTGAC CGGCAATCCG
CAGGCGCCGC AGAACCCGGC ATTGGCGGGG TACCTGAAGG ATCCGCAGCA ACTGATCAGC
CAGTACGCCT TCGAAAATGG TCCGCCGCTG GCGCTGGTGG TCGATTTCGA TCCAACCGCC
ACCGGTAATG CGCTGCGTGC CGCCGGCCTG CCGAGCTGGG GCGCCAACCG CCCGGCGGTG
CTGGCCTGGT GGCTGAACGA AAGCGCCGAT GGCAGCACCC TGGTCGGTGA CAACCAGGCC
TCGGCCGAAC CGCTCAAGCG TGCGGCGCAG AACCGCGGCT TGCCGTTGCG CCTGCCTCTG
GCGGATCTCG ACGAACAGAT CGTCGGTACC CCGGAGAACC TCACCGCCGC CCAACCCGAT
GCCCTGCGCG CAGCCTCCGA GCGTTATGCC GCCGATGCCT TGCTGGCAGT GGACGCCAAG
GAGGCGGACG GCAAATGGCA GGCGCAATGG CGGCTGTGGA TGGGCGATTC GCGGGAGCAA
GGCCAGGCTG AAGGCGCTAC GCCCGACGCG TTGGCAGACA GCGTGATGCT GGCCGTCGGC
AACCGCCTGT CTACCCGTTT CGTTGCCACG CCGGGAGCGG CGACCGGCCT GACCCTCCAG
GTCCAGGGCG CGACACTGGC ACGCTATGCC GAGTTGCAAC GCCTGCTCGA TCCGTTCGGC
GCGCGTCTGG TAGGCGTGCG GGGCGATCGC CTCGACTATT CCGTGAAGGC CAGTCCCGAG
CAATTACGTG CCCAGCTGGG CCTGGCGCAG TTGCAGGAAA TCCCGGCCGA CAGCGTACCG
CTGGATGCCT CCGGCCAGCC CGCAGCGCCC AGCGCGGCGG TGCCGTCGTC GTCCCAACTG
AATTTCCGCT GGCAGTGA
 
Protein sequence
MRIARLFVLC FSLLGLPVFA ATVPNLYQVH EPVSSQQPGE RDAGLVRALQ TLLVRLTGNP 
QAPQNPALAG YLKDPQQLIS QYAFENGPPL ALVVDFDPTA TGNALRAAGL PSWGANRPAV
LAWWLNESAD GSTLVGDNQA SAEPLKRAAQ NRGLPLRLPL ADLDEQIVGT PENLTAAQPD
ALRAASERYA ADALLAVDAK EADGKWQAQW RLWMGDSREQ GQAEGATPDA LADSVMLAVG
NRLSTRFVAT PGAATGLTLQ VQGATLARYA ELQRLLDPFG ARLVGVRGDR LDYSVKASPE
QLRAQLGLAQ LQEIPADSVP LDASGQPAAP SAAVPSSSQL NFRWQ