Gene Pfl01_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPfl01_4067 
Symbol 
ID3714890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas fluorescens Pf0-1 
KingdomBacteria 
Replicon accessionNC_007492 
Strand
Start bp4590575 
End bp4593805 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content56% 
IMG OID 
Productglycosyl transferase 
Protein accessionYP_349795 
Protein GI77460288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0325707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTA GTGATTTCTC CTCTGACCTG AAGATTCATG CCAGTTCCAA GGCACCGTCG 
AAACCGAAGG TCACGGTGAT TCTCCCGACC TATTCCCGCG GCCACGGGCC GCTGCAGGAA
TCCATCGACA GTGTATTGGC ACAAAGTTAC CGAAACTTTG AACTGATCAT TGTCGACGAC
GGTTCACGGG ACGGCTCTGC GGCCGTCCTG CAGGAATACC TGAAGAAAGA TCCGCGCATC
ATTGTTCATT CGTACCACAA GAACAGCGGC CTGCCGGCCT TGCGCGTCAA CCAGGCGGCA
CTGCGGGCGA AAGGCAAATA CATTGCCTAC CAGTTCGACG ACGACATGTG GACCGAGCAC
AGCCTGCAAG TGCGTGTCGA GCAGCTGGAA AAGTGCACCA GACCTGCAGT GGCCTATGCC
AACGCCAGCG TCGACATCGC TCTCGCCGAC GGGTCGATCA CTACCCGCAA GCTGGGCGGA
CCGTTCAACT ATGGCCTGTT GATGAACGGC AACTACATTG CCAACAACAC GGTCATGCAC
CACAAGTCCC TGTTTGATAT CGCAGGCATG TATGACTCGC ACGTCATGAT GCGCCGTTAC
TCGGACTATG ATCTGTGGTT GCGATTCGCC AAACACGCCG ACTTTATCTG GGTCGACGAA
GTGACGTCGC ATGTACGGGC CAACCTGGTC GACTCGCTGG GCAAGGAAAT TCCATTGTTC
TTCGCGCGGT ACAGAAAGAG CCTGGCGATA CCACGCGATC ATTTGCTGAC CCCTGCAAAA
ATCAACGACT ACGACGTCAT CGATCTGTCG CAATTTGCCG ATACGTTCAC CGATGCGGAA
ATCATTGAAT ACCGGCGCAT GGAAGCCGCG CCGTTCCTGA CCAAGTTCAA TGATTACTGC
AGCGACTCCG AAATGGCGGT CGCAGCGGCC GCTCGAGGAC GCAAGCTGCA CTTGTTGACC
GTGAAACCGG ATTATTCGAC ATCGGTGGAT GTCACGATCC ACAACTTCAC GCAACTGGCA
GAACAGCGGG CGATCACCAG CACTTTCGTC AAGGAAAACG ACCTGCCGGT CATCGATCTG
GCGGGCGTGG ATGTGACCGT GCTCTATCGC ACTGTCGGCG TTGCCGGCAG CCAGTTCGTC
AAGAACCAGA AAGGCAATAT GCCTTTGGCC TACTTGATGG ACGACAACAT GCTGCACTTC
CATGAAGTCG GTCCCCAACA CAGCTTTCTT GCGCCTGGCA CACCGACCTA TCAGAACATC
GCGCAACAGA TCCAGTCGGT CGACACCTGC ATCGGTTACA GCGATGCAAT CAATGAAGAC
CTGCGCGAGC TCAACAGTAA AACAGTTCGG CTCAATACCA ACATTCATGC CCGGTTCGTG
CAAAAACGCA CCTATAGCCG CGGCAAGCGT CTGAAAGTTG CAATCATGTC CGGTGCCGTG
CGCGAGGATA TTCTGCGCGA GCTCTGGCAA GCACTTGCGA ACTTCGCCAG CGCCCACGCC
AATTCGGTAG AGTTTCACTT CTGGGGTCTG GATCCGGAGA AATTCGGCAC ACTGGAATGT
CCGGTCTTCT TCAAACCGTT CACTCACGTC TACGAAAGCT ATATTCGCGA CCTGAGCGAA
ACCAGCTTCG ACATCGCACT GGTTCCCCTG GACTTCAGCA CCCGCGCCGC GCGCAGCAAA
AGCCCGGTCA AATTGCTGGA ATCAGTCGCC GCCGGCGCCA TCGGCATCTT CACCGATGCC
GTACCCTACT CCGACATTCC AGATGCCTGT TGTGTGAAGG TCGAAAACAC GGTCGAGGCC
TGGGAACAGG CGCTGAATCA TTGCTACGAA ATGGGCCAGC AAAAGCGTGA CCAGATGCTG
GAAAACGCCC GGGAACTGGT GCTTTCGCGC TATACCACCG AATCGCAGTT CTACGACTTC
CTTGCCAGCT GCGAAGCCGT TCGCCTGCAT TCCAGACTGG GTGACAAAGC CATTGCCTAT
GCCTTCCACG AAACAGCGCT GGGCGGCGCC ACGCTGCACC TGATCCGCCA CGCCAGCCTC
GTGGCTTCGC TGGGTTTCCG CGTTGTCGGC ATCGTTCCGC AGGATGCCAC CTACGGCCCG
ATCTTCAAGG CGCGCTGGGA CGCCGCAACG AATGGGGCGT GCCTGCTCGA AGAGCAATGG
CCTTCGGGGT ATATCGACAG TCCGCAACCG CAGCGTCCGT TTCAGCCGCA GGATGAGGCG
GCAGCAAGGC ATCTGTCCAA TTGCCTCGAG CCGGAAAGAG TCGGCTTCCT GCACTTTGCG
ACATGGTCAC CGACCATGAG CCTGCTGGCC AAACAACTGG GCATTCCCTG CAGCGCCAGT
GTCCACCAGT TTTATGAGGG CGCCGGCAAC TCCATCGTTC ACTTTGCCGA CTCGATCCAC
TGCTCCTCGC TGACTCATGG CTTCAAATGG TCGAGCCTGT CGAAAAGTCC GGCCCGGCGT
ATCGTCTGCC CGGTCGCCGA CGATTACTTC GCCAGTTTCG CTACCAATCG CGCCCGTGCA
GCCAAGAGCG GCAATGCCTT GCGAATTCTT GTCTCCGGCA CGCTGCAGCC GCGAAAGAAC
CAACTCGGCG CAATTCAGGC GGCAATCCTG CTCAACGATG CCGGCTTCGA TGTTTCGCTG
GACCTGATCG GCTACACGGT GTTTCACAAT CAATACCTGG CCGAGTGCAC TTCATTAATT
GAAGCCAGCC AATATAAAGA TAAATTCATC ATTCATGGTT TTATCGATGA TCCAAAACCG
TTTTATGATA AGTGCGACTT GTTGTTGATA TCGGCTACCG ATGAGTCCAT GCCTCAAACC
ATGTTGCAGG CAATGGCCAT GGGTATACCC GTTGTTTCAA CAATTGTCGG CGGCGTAGGG
GAAATCATCA AACATCGCTA CAGCGGCTTC CTCGCCCAGG ACGACAGTCC AGAGGCCATG
GCAGCGGCAG TATCGCAGTA CATCCGGTTG TCACAATTGC AGCGTCTGGA AATCATCGAT
CGTGCGCAGC GATCCATGAA ATTCCTTGCC CGCCCGACCT ACGTCCGCTC CGAACTGGTG
GATCTGTACA ACCAGGCATT CGAGGAGTTC GCCCGGCACC GCAAAGCTGC ACAAAAGCCC
GGCGACAACT CGTCCAGGAC CTCGGCGTCT TCGTATGAAC AGATGCTGCT TGCAACTCTG
AACACCACCC GCAGTCAGAT CAGCCAATTA AGCAGAGCGC TGGATAAATG A
 
Protein sequence
MKFSDFSSDL KIHASSKAPS KPKVTVILPT YSRGHGPLQE SIDSVLAQSY RNFELIIVDD 
GSRDGSAAVL QEYLKKDPRI IVHSYHKNSG LPALRVNQAA LRAKGKYIAY QFDDDMWTEH
SLQVRVEQLE KCTRPAVAYA NASVDIALAD GSITTRKLGG PFNYGLLMNG NYIANNTVMH
HKSLFDIAGM YDSHVMMRRY SDYDLWLRFA KHADFIWVDE VTSHVRANLV DSLGKEIPLF
FARYRKSLAI PRDHLLTPAK INDYDVIDLS QFADTFTDAE IIEYRRMEAA PFLTKFNDYC
SDSEMAVAAA ARGRKLHLLT VKPDYSTSVD VTIHNFTQLA EQRAITSTFV KENDLPVIDL
AGVDVTVLYR TVGVAGSQFV KNQKGNMPLA YLMDDNMLHF HEVGPQHSFL APGTPTYQNI
AQQIQSVDTC IGYSDAINED LRELNSKTVR LNTNIHARFV QKRTYSRGKR LKVAIMSGAV
REDILRELWQ ALANFASAHA NSVEFHFWGL DPEKFGTLEC PVFFKPFTHV YESYIRDLSE
TSFDIALVPL DFSTRAARSK SPVKLLESVA AGAIGIFTDA VPYSDIPDAC CVKVENTVEA
WEQALNHCYE MGQQKRDQML ENARELVLSR YTTESQFYDF LASCEAVRLH SRLGDKAIAY
AFHETALGGA TLHLIRHASL VASLGFRVVG IVPQDATYGP IFKARWDAAT NGACLLEEQW
PSGYIDSPQP QRPFQPQDEA AARHLSNCLE PERVGFLHFA TWSPTMSLLA KQLGIPCSAS
VHQFYEGAGN SIVHFADSIH CSSLTHGFKW SSLSKSPARR IVCPVADDYF ASFATNRARA
AKSGNALRIL VSGTLQPRKN QLGAIQAAIL LNDAGFDVSL DLIGYTVFHN QYLAECTSLI
EASQYKDKFI IHGFIDDPKP FYDKCDLLLI SATDESMPQT MLQAMAMGIP VVSTIVGGVG
EIIKHRYSGF LAQDDSPEAM AAAVSQYIRL SQLQRLEIID RAQRSMKFLA RPTYVRSELV
DLYNQAFEEF ARHRKAAQKP GDNSSRTSAS SYEQMLLATL NTTRSQISQL SRALDK