Gene PA14_55220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_55220 
Symbol 
ID4382550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp4903681 
End bp4904997 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content71% 
IMG OID639327033 
ProductMFS family transporter 
Protein accessionYP_792591 
Protein GI116048610 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.872465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00000012197 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCTCCC CCACCCGTCC CCCGGCGGCC GCCGATCCGG CCAGCCGCCG CAGCGTCTTT 
GCCGTGGTCC TCGGCAACGC CGTGGAATTC TTCGACTTCG GGGTCTATGC CACCTTCGCG
GTGATGATCG GACGGACTTT CTTCCCCTCC GACAGCGCCT TCGTCAGCCT GCTCCTGTCG
GTCACCGCGT TCGGCGTCGG CTTCGTCATC CGCCCCCTCG GCGCGATCCT TATCGGCGCC
TACGCCGACC GCGCCGGGCG CAAGCCGGCG ATGCTCCTTA CCCTGTTCCT GATGGCGCTG
GGCACCGGCG GCATCGCGGT GCTCCCCGGC TACGACAGCA TCGGCCCGGC CGCGCCGCTG
CTGCTGGTGT TGACCCGCCT GCTGCAAGGC CTGGCCTGGG GCGGCGAGGC CGGGCCGGCG
ACCACCTACA TCCTCGAGGC GGCACCGCCG CACAAGCGCG GCACCTACGC CTGCTGGCAG
GTCGTGGCGC AAGGCATCGC GGCGGTCGCC GCCGGGCTGA TGGGCTACCT GCTCACCCTC
TGGCTCGACG AGCGCCAACT GCAGGAATGG GGCTGGCGGA TTCCCTTCGC GGCCGGCCTG
CTGGTCCTGC CGATCGGCCT GTACATCCGC CTCAACCTGG CCGAGACCTT TTCCGGACGC
GGCCGCCAGG CCAGCACCCG GAACCTGCTC GGCGAGTTGT TCGGCAATCA TCGGCGGGCC
CTGCTGCTCG GCCTGCTGAT CCTCTCCGGA AGCACCATCA CCCAGTACTT CCTCAACTAC
ATGACCACCT TCGCCCTTAC CGAGCTACAC CTGCCGGCGG GCATCGCGAT GCTCTCGACG
CTGGTCGCCG GCGCCGCACT GGCGCTCTCG GCGTTGCTCG GCGGCGTGCT CTGCGACCGC
TACGGGCGCC GCGCCGTGCT GATCCTGCCG CGCCTGGCGC TGCTCGCGGT GCTGTTCCCG
GCACTGCAGG CAATGACCCG TCACCCCGAG CCGGCAGTCT TCCTCGCCGT CCTCGCCCTG
CTCTCGGCCC TGCATGGCAT GAGCGGCGCG GCGCTGATCG TGCTACTGGT AGAAAGCTTC
CCGCGGGCGC TGCGCTCCAC CGGTTTTTCC CTGGTCTATG CGACCGGCGT CGCCGCGTTC
GGCGGCACCG CGCAGATCGT GGTGACCTGG CTGATCGGCG TCACCGGCAA TCCGCTGTCG
CCGCTGGGCT ACCTGCTGCT GGCCAACCTG GTCTGCCTGG TTGGGGCCTG GCTGGCCCGC
GAGACCTGGC CGGGTCGCGG GGACATGGGA GGCGCGCCGC TGGTGCTGCG CGACTGA
 
Protein sequence
MRSPTRPPAA ADPASRRSVF AVVLGNAVEF FDFGVYATFA VMIGRTFFPS DSAFVSLLLS 
VTAFGVGFVI RPLGAILIGA YADRAGRKPA MLLTLFLMAL GTGGIAVLPG YDSIGPAAPL
LLVLTRLLQG LAWGGEAGPA TTYILEAAPP HKRGTYACWQ VVAQGIAAVA AGLMGYLLTL
WLDERQLQEW GWRIPFAAGL LVLPIGLYIR LNLAETFSGR GRQASTRNLL GELFGNHRRA
LLLGLLILSG STITQYFLNY MTTFALTELH LPAGIAMLST LVAGAALALS ALLGGVLCDR
YGRRAVLILP RLALLAVLFP ALQAMTRHPE PAVFLAVLAL LSALHGMSGA ALIVLLVESF
PRALRSTGFS LVYATGVAAF GGTAQIVVTW LIGVTGNPLS PLGYLLLANL VCLVGAWLAR
ETWPGRGDMG GAPLVLRD