Gene Pfl01_5240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPfl01_5240 
Symbol 
ID3716191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas fluorescens Pf0-1 
KingdomBacteria 
Replicon accessionNC_007492 
Strand
Start bp5895992 
End bp5897170 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content60% 
IMG OID 
Productglycine betaine/L-proline transport ATP-binding subunit 
Protein accessionYP_350968 
Protein GI77461461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0848067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.021429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATAA TTCGCTTCGA TAACGTTGAC GTGATCTTCA CCAAGGATCC GCGCGAAGCA 
CTGAAACTTC TCGATCAAGG CCTGACCCGC AGTGAAATCC TGAAAAAGAC CGGGCAGATC
GTCGGCGTTG AAAAGGCCAG CCTGGACATC AACAAAGGCG AGATCTGCGT GCTGATGGGC
CTCTCCGGCT CCGGCAAGTC GAGCCTGCTG CGCTGCATCA ACGGCCTCAA CACCGTGAGC
CGCGGCAAGC TGTTCGTCGA ACATGAAGGC AAGCAGATCG ACATCGCCTC CTGCTCCCCG
GCCGAGCTGA AAATGATGCG CACCAAACGC ATCGCCATGG TGTTCCAGAA GTTCGCCCTG
ATGCCCTGGC TGACGGTGCG CGAGAACATC AGTTTCGGTC TGGAAATGCA GGGTCGTCCG
GAGAAGGAAC GGCGCAAACT GGTCGATGAC AAACTCGAAC TGGTGGGCCT GACCCAATGG
CGCAACAAGA AGCCCGACGA GCTGTCCGGC GGCATGCAGC AGCGTGTCGG CCTGGCCCGC
GCGCTGGCGA TGGACGCCGA CATTCTGCTG ATGGACGAAC CGTTCTCGGC CCTCGACCCG
CTGATCCGTC AGGGCCTGCA GGATGAACTG CTGGAACTGC AACGCAAGCT GAGCAAGACC
ATCGTGTTCG TGAGCCACGA CCTCGACGAG GCGCTGAAAC TCGGCAGCCG CATCGCGATC
ATGAAAGACG GCCGGATCAT CCAGTACAGC GTGCCGGAAG AGATCGTGCT CAATCCTGCG
GACGATTACG TGCGCACCTT CGTCGCCCAC ACCAACCCGC TGAACGTGCT GTGCGGTCGC
AGCCTGATGC GCACCCTGGA CAACTGCAAA CGCATCAACG GTTCGGTATG TCTGGATCCG
GGCGGCGATT CGTGGCTGGA CCTGGCCGAA GGCAACACCA TCAAGGGTGC GCGGCAGAAC
GGTTCGGTGC TGAACCTGCA GAACTGGGCA CCGGGGCAAG CCGTGGAAGG GCTGGAGCGC
AAACCGACGC TGGTGGACTC GAACATCGGC ATGCGCGACG CGTTGCAGAT CCGATACCAG
ACCGGCAACA AACTGGTGCT GCACGACAAC AACCATGTGG TGGGGATTCT TGGGGACAGC
GAGCTGTATC ACGCGTTGCT CGGGAAGAAC CTAGGGTAA
 
Protein sequence
MSIIRFDNVD VIFTKDPREA LKLLDQGLTR SEILKKTGQI VGVEKASLDI NKGEICVLMG 
LSGSGKSSLL RCINGLNTVS RGKLFVEHEG KQIDIASCSP AELKMMRTKR IAMVFQKFAL
MPWLTVRENI SFGLEMQGRP EKERRKLVDD KLELVGLTQW RNKKPDELSG GMQQRVGLAR
ALAMDADILL MDEPFSALDP LIRQGLQDEL LELQRKLSKT IVFVSHDLDE ALKLGSRIAI
MKDGRIIQYS VPEEIVLNPA DDYVRTFVAH TNPLNVLCGR SLMRTLDNCK RINGSVCLDP
GGDSWLDLAE GNTIKGARQN GSVLNLQNWA PGQAVEGLER KPTLVDSNIG MRDALQIRYQ
TGNKLVLHDN NHVVGILGDS ELYHALLGKN LG