Gene Pfl01_5040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPfl01_5040 
Symbol 
ID3713696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas fluorescens Pf0-1 
KingdomBacteria 
Replicon accessionNC_007492 
Strand
Start bp5676977 
End bp5678083 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content64% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_350768 
Protein GI77461261 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.763439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00496845 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCCCGA CATCCCGCAG GAAAGTGGTC GTCGCCCATT CGGTTCGGCC CGGCGCACCA 
CAACATGAAG TGCAGACCAA CAAGGCCCTC GCGCGATGGC TGGCGCAGAT CCTCGGCCTC
AAGTTCGGCG GCAGTTACGA CGCCGAAAAG CATCGCGGCC GGGATATTTA TCTGCTGCCG
ACCCAGACCC TCGTCGGCGC GGCGGCCCGC GAACTGGGGG TGAGCGGCCC GGACGATTTG
TGGGGCGGTT TCGTCGAGCA CGATTTCATC TGCACCAAAG CCATCAGCCA CGGTTTGCGC
AGCCATCAGG CCCATGCGCC GCAAGGCTGG TCGCCCTTGT TTTCCGAGCG GGTGCGCACC
GTGGTGCTGG ACGGGCTGAG TGTTTTTGCG CTGGAGGATG CACGGCCCGC CGCCGAACAT
CTGTTGTACA GCGGGCCGAT CCGGATCAAG CCGATTCACG CCTGTGCCGG GCGGGGGCAG
GAAGTGATCA AGAGCCTGGA TGCGTTCGAC GAAATCCTCG CCCGACCCGA GGCTAGAGAA
TTGTTCAGCG ATGGCGTGGT GCTGGAGCAG GATTTGAGTC AGGTAGTTAC CCACAGCGTC
GGCCAGTCGT TCATCGGCGG CAGGGTGCTG AGTTACTGCG GTGATCAATA CTTGACCAAG
GACGCCCACG GCGAAGAGGT GTACGGCGGC TCGAACCTGC TGGTGGTGCA GGGCGGTTAC
GAGGATCTGC TGGCGCTGGA TCTGCCCGAC GACGTGCGTC TGGCGATCCA GCAGGCGCAG
GTGTTCGACC GGGCGGCGGA CGAGGCCTAT CCGCGTTTCT ACGCCTCGCG GCGCAATTAC
GACATCGCCC AGGGCCTGGA CAGCGAAGGC CGGCCGCGCA GTGGCGTGCT CGAGCAGTCC
TGGCGCATGG GCGGCGCCAG CAGCGCGGAA GTGGCGGCGC TGCAAAGTTT CGTCAACGAT
CCTTCGATGC GCGCGATCCG CGTGTCGTCG GTGGAAACCT ATACCGATCA GGCCCTGCCG
GCGGATGCCA TCGAGGTGTA TCGCGGGCCG GCCGAGAACA GCGACTTTCT CCTCAAATAC
GTAACGGTCA AATCCTATGA CGGCTAG
 
Protein sequence
MTPTSRRKVV VAHSVRPGAP QHEVQTNKAL ARWLAQILGL KFGGSYDAEK HRGRDIYLLP 
TQTLVGAAAR ELGVSGPDDL WGGFVEHDFI CTKAISHGLR SHQAHAPQGW SPLFSERVRT
VVLDGLSVFA LEDARPAAEH LLYSGPIRIK PIHACAGRGQ EVIKSLDAFD EILARPEARE
LFSDGVVLEQ DLSQVVTHSV GQSFIGGRVL SYCGDQYLTK DAHGEEVYGG SNLLVVQGGY
EDLLALDLPD DVRLAIQQAQ VFDRAADEAY PRFYASRRNY DIAQGLDSEG RPRSGVLEQS
WRMGGASSAE VAALQSFVND PSMRAIRVSS VETYTDQALP ADAIEVYRGP AENSDFLLKY
VTVKSYDG