Gene Pfl01_4178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPfl01_4178 
Symbol 
ID3712542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas fluorescens Pf0-1 
KingdomBacteria 
Replicon accessionNC_007492 
Strand
Start bp4714320 
End bp4716563 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content65% 
IMG OID 
ProductDNA internalization-like competence protein ComEC/Rec2 
Protein accessionYP_349906 
Protein GI77460399 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.145225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.646667 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACAG GGATGATGGC GCTGGCGGTC GGTCTGCTGG CTCCGGTTTT TTTACCGGCC 
TTGCCGCCGG TCGGGTTATT GCTGCTGTTG CCGTGGGTGG CGCTGATGCT GCTGCCGTTT
CGCAGCTATC CACTGGCTTT TTTACTGCTT GGATTCACAT GGGCGAGTTT CAACGCGCAG
TTGGCGTTGA ATGATCGACT GCCTTCCCGG CTGGATGGAG AAACACGCTG GGTGGAAGGA
CGGGTCGTCG GCTTGCCGCA GAATGCCGAG GGCGTCGTGC GCTTCGAACT GGCCGATGCG
CGCTCGCGTC ACGAGAAACT GCCGTCGCTG ATGCGTCTGG CGTGGTACGC CGGGCCTGAG
ATCAAAAGTG GCGAACGCTG GCGACTGGCG GTCAAGCTCA AGCGTCCCGG CGGCCTGCTC
AATCCGGATG CGTTCGATTA CGAAGCCTGG CTGCTGGCGC AGCGCATCGG TGCCACCGGG
ACGGTGAAAG ATGGCCAGCG TCTGGCGCCG GCGCAGTGGG CCTGGCGCGA CAGTATTCGC
CAGCGCCTGC TTGCGGTGGA TGCCCAGGGA CGTAACGGCG CCCTGGCTGC GTTGGTCCTC
GGTGATGGTT CGGGTCTCGG CCGCGAAGAC TGGCAGATAT TGCAGGACAC CGGCACCGTG
CATCTGCTGG TGATTTCCGG TCAGCACATC GGGCTGTTGG CTGCGTTGAT GTATGCGCTG
GTCGTCGGGC TGGCGCGGGT CGGGATCTGG CCGTTGCGTT GGCCCTGGCT GCCCTGGGCG
TGTGGTCTGG CGTTCGCGGC GGCGCTGGGT TACGGACTGC TGGCCGGGTT CGACGTGCCG
GTACGGCGAG CCTGCGTGAT GGTTGCGCTG GTGCTGTTGT GGCGTTTGCG CTTCCGCCAC
CTCGGTGCAT GGTGGCCATT GCTGTTGGCG TTCGACGGCG TGTTGCTGAT GGATCCGCTG
GCCAGCCTGC GCCCGGGGTT GTGGCTGTCG TTCGTGGCGG TGGCTGTGTT GATCTTCACC
TTCGGCGGCC GTCTGGGGCC GTGGCGCTGG TGGCAGACCT GGACGCGCGC GCAGTGGCTG
ATCGCGATTG GTTTGTGCCC GGTGCTGCTG GCGCTGAGCC TGCCGATCAG TCTCAGCGGG
CCGCTGGCCA ATCTGCTGGC GGTGCCATGC GTCAGTTTTG CGGTGCTGCC GCCAGCGTTG
CTGGGCACAT TGTTGCTGCC GATCCCTTAT GTCGGGGAAG GGCTGCTGTG GCTGGCGGGC
GGACTGATCG ATGGATTGTT CCGGGCGCTG GCCCTGATTG CCGGGCGGTG GCCGGCGTGG
ATTGCGGCGT CGATACCGGG ATGGGTTCTG GCGCTGGGTT GCATCGGTGC ACTGCTGTTA
TTGCTGCCCC GAGGCGTACC CCTGCGCCTC TTAGGCTGGC CATTGCTGCT GGTGCTGGCA
TTTCCGCCTC GGGAACGCCT GTCCGAAGGC GTGGCCGATG TCTGGCAACT GGACGTCGGT
CAGGGCCTGG CGATTCTGAT CCGCACCCGG CATCACACCT TGCTGTACGA CAGTGGCCCG
CGTTTCGGCG ACTTCGACCT CGGCGAGCGA GTGGTGCTGC CGGCATTGCG CAAACTGGGC
GTGGAACACC TCGACCTGAT GCTGCTCAGC CATGCCGATG CCGATCATGC AGGTGGCGCA
CTCGCGGTGG CGAAAGGATT GCCGGTCAGT CGGGTCATCA GCGGCGATCC GTCGGGGCTA
GCCGAAGCGC TGAACGCCGA GGCTTGTGAA AGTGGTCGGC AATGGCAGTG GGACGGCGTG
GCCTTTCATC TATGGCAGTG GACCGACGCC CATGACAGCA ACCAGCGTTC CTGCGTGTTG
CAGATCGAAG CCAATGGCGA ACGGTTGCTG CTGAGCGGTG ACATCGACAG CGCCGCCGAA
CGGGACCTGC TCAACAGCGC ACTGGCAGTT CATACCCAAT GGCTACAAGC CCCCCACCAT
GGCAGCCGCA GTTCTTCGTC GATGGCCTTG CTCAAGGCTT TGCAGCCACA GGCCGTGCTG
ATTTCCCGAG GGCAGGGCAA TTCGTTCGGC CACCCGCACC CGACGGTCAT CGCCCGCTAC
CGCAAACAGG GCATACAAAT CCATGACAGC GCCGAGCAGG GTGCCATTCA TCTGCAACTG
GGGCGGTTTC AGCCGGCCCG GTCGATGCGT CAACAACGCC GGTTCTGGCG CGACCCGCCG
TTGCCGGGGG CGGAGTACCG TTGA
 
Protein sequence
MRTGMMALAV GLLAPVFLPA LPPVGLLLLL PWVALMLLPF RSYPLAFLLL GFTWASFNAQ 
LALNDRLPSR LDGETRWVEG RVVGLPQNAE GVVRFELADA RSRHEKLPSL MRLAWYAGPE
IKSGERWRLA VKLKRPGGLL NPDAFDYEAW LLAQRIGATG TVKDGQRLAP AQWAWRDSIR
QRLLAVDAQG RNGALAALVL GDGSGLGRED WQILQDTGTV HLLVISGQHI GLLAALMYAL
VVGLARVGIW PLRWPWLPWA CGLAFAAALG YGLLAGFDVP VRRACVMVAL VLLWRLRFRH
LGAWWPLLLA FDGVLLMDPL ASLRPGLWLS FVAVAVLIFT FGGRLGPWRW WQTWTRAQWL
IAIGLCPVLL ALSLPISLSG PLANLLAVPC VSFAVLPPAL LGTLLLPIPY VGEGLLWLAG
GLIDGLFRAL ALIAGRWPAW IAASIPGWVL ALGCIGALLL LLPRGVPLRL LGWPLLLVLA
FPPRERLSEG VADVWQLDVG QGLAILIRTR HHTLLYDSGP RFGDFDLGER VVLPALRKLG
VEHLDLMLLS HADADHAGGA LAVAKGLPVS RVISGDPSGL AEALNAEACE SGRQWQWDGV
AFHLWQWTDA HDSNQRSCVL QIEANGERLL LSGDIDSAAE RDLLNSALAV HTQWLQAPHH
GSRSSSSMAL LKALQPQAVL ISRGQGNSFG HPHPTVIARY RKQGIQIHDS AEQGAIHLQL
GRFQPARSMR QQRRFWRDPP LPGAEYR