Gene Pfl01_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPfl01_3149 
SymboldnaE2 
ID3712874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas fluorescens Pf0-1 
KingdomBacteria 
Replicon accessionNC_007492 
Strand
Start bp3614981 
End bp3618058 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content64% 
IMG OID 
Producterror-prone DNA polymerase 
Protein accessionYP_348878 
Protein GI77459371 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGG GTTATGCCGA GCTGCACTGC CTGTCGAATT TCAGTTTCCA GCGCGGCGCG 
TCCAGTGCGC TGGAACTGTT TCAGCGCGCG AAAAAGCACG GCTATCAGGC GCTGGCGATT
ACCGATGAAT GCACCCTCGC CGGGATCGTC CGTGCCTGGC AGGCAGCCAA GTCCGTGGAG
TTGCCGCTGA TCATCGGCAG CGAAATCCGT ATCGAGAACG GCCCGAAACT GGTGTTGCTG
GTGGAGAACA TCGAGGGTTA TCAGGCCCTG TGCGGCTTGA TCACCCAGGC TCGACGGCGT
ACCCAAAAAG GCCAGTATCA AATACTGCGC GAAGATTTCA GCGAGCCGTT GCCGGGATTG
CTGGTGCTGT GGGTGCCGGA GGCGGTCGAT GAGGTGGAAG AGGGGCGCTG GCTGAAACAG
ACGTTCGGCG AGCGGCTGTG GCTGGCGGTG CAGTTGCATC GTGGGCAGAA CGATCAGCAA
CGGCTTGCCG CACTGTTGAG TCTGGCGGAT GAGCTGCAAA TTCCAGCCGT GGCCAGCGGC
GATGTGCACA TGCATGCCCG TGGCCGGCGA GCCTTGCAGG ACACCATGAC CGCGATCCGC
CATCACGTCC CGGTGGCCGA GGCCGGATTG CGCCTGCACC CCAACGGCGA GCGGCATTTG
CGCAGCCTCG ATGTATTGCG CGAGCTGTAT CCACAGACCT TGCTGGACGA ATCGCTAAAG
CTGGCCCGCC GCTGCACCTT CGATCTGGGC GAGTTGCGCT ATCAATACCC CAAAGAGCTG
GTGCCCGAGG AACACAGCGC CAGTTCCTGG CTGCGCCACC TGACCGAACA AGGCATCGCC
TGGCGCTGGC CGAAAGGCGC ACAGCCCAAG GTGCTCAAGC AGATCGACGA TGAGCTGGAG
CTGATCGCCG AACTCGGCTA CGAAAGCTAC TTCCTCACCG TGCACGACGT GGTGCGCTTC
GCCCGTGAGC AGAAAATTCT CTGTCAGGGT CGGGGTTCGG CGGCCAACTC GGCTGTGTGT
TTTGCCCTGG GCATCACCGA GATCGACCCG GACCGCACCA CGCTGCTGTT CGAGCGTTTC
ATGTCGAGGG AGCGCAACGA GCCGCCGGAC ATCGACGTCG ATTTCGAGCA TGAACGTCGC
GAAGAGGTCC TGCAATACGT GTTTCGTCGT TATGGTCGCC GTCGTGCCGC GCTGACCGCG
GTGGTCAGCA CCTACCACGC TTCCGGCGCG ATCCGCGATG TGGCCAAGGC GCTGGGCTTG
CCGCCGGACC AGATCAATGC GCTGGCCGAC TGTTGCGGTC ACTGGAGCGA TGAAACCCCA
CCCGTGGAGC GCCTGCGAGA GGGCGGTTTC GACCCCGAAA GTCCGCTGCT GCACCGGGTG
CTGAGCCTGA CCGGGCAACT GATCGGCTTC CCCCGGCACC TGTCGCAGCA CCCCGGTGGT
TTCGTGATAT CCGAGCAACC GCTGGACACG CTGGTGCCGG TGGAAAACGC CGCCATGGCC
GACCGCACGA TCATCCAGTG GGACAAGGAC GATCTCGACG CCGTCGGCCT GCTCAAGGTG
GATATCCTCG CCCTCGGCAT GCTCAGCGCG ATCCGCCGCT GTTTCGACCT GCTGCGCCGC
CATCGCCATC AGGACCTGAG CCTGGCCACG ATCCCGCCTG AGGATCGCCC GACCTACGAC
ATGATCAGCC GCGCCGATAC CATCGGCGTG TTCCAGATCG AGTCCCGGGC GCAGATGTCG
ATGCTGCCGC GTCTGCGCCC GCAAACCTTC TACGACCTGG TGATCGAGGT GGCCATCGTC
CGGCCGGGGC CGATACAGGG CGGGATGGTC CATCCGTACC TGCGCCGGCG GAACAAGGAA
GAAGAGGAAA CCTATCCGTC CCCGGAACTG GAAGTGGTGC TCAAGCGCAC CCTCGGTGTG
CCGCTGTTTC AGGAACAGGT GATGCAGATC GCGATTGTCG CCGCCGACTA CAGCCCCGGT
GAGGCCGACC AGTTGCGCCG CTCCATGGCC GCGTGGAAGC GCCACGGCGG GCTGGAGCCG
CACAAGGAGC GATTGGCCGC CGGCATGAAG AAAAACGGCT ACAGCCCGGA ATTCGCCGCG
CAGATCTTCG AGCAGATCAA GGGCTTCGGC AGTTACGGAT TCCCTGAATC CCACGCTGCC
AGTTTTGCCT TGCTGACCTA TGCCAGTTGC TGGCTCAAGT GCCACGAACC GGCGGCGTTC
GCCTGTGCGC TGATCAACAG CTGGCCGATG GGTTTCTACA GCCCGGATCA GATTCTTCAG
GATGCGCGCC GGCATCATTT GCAGATCCGC CCGGTGGATG TACGCGCCAG TGACTGGGAT
TGCAGCCTGG AGCCGATTGC CGGCGAGCAA CCGGCCATCC GCATGGGCCT GCGGATGATC
AAGGGCTTTC GAGAGGAGGA TGCCCGGAGC ATTGAAAAGG CTCGGGCGAG GGGGGCGTTT
GCCGATGTCG CCGATCTGGG CGAACGGGCC GGGCTCGACA GTCGCGCCCA GGCGTTGCTG
GCGGATGCCG GGGCGTTGCG CGGCCTGGCC GGTCACCGCC ATCGGGCACG CTGGGAAGTG
GCCGGGGTGC AGAAACAGCT CGGGCTGTTT GCCGGGTTGC CGAGTCAGGA GGAACCGGAT
GTGTTGCTGC CGACGCCGAG TGTCAGCGAA GACCTGTTCA CCGACTACGC CACCCTTGGC
ACCACGCTGG GGCCGCACCC GCTGACGCTG TTGCGCAACG AGTTGCGGGC GCGGCGCTGC
CGCAGTTCCC GGGACTTACT GGAGGTCGAA CACGGCCGGC CGGTCAGCGT GGCGGGGCTG
GTGACCGGGC GTCAGCGGCC GGGCACTGCC AGTGGCGTGA CCTTCGTGAC CCTGGAAGAC
GAGTTCGGCA ACGTCAACGT GGTGGTCTGG CGCGATCTGG CCGAGCGTCA GCGGCAGGTG
CTGGTGGGCT CGCAATTGCT CAAGGTCGAT GGCCGCTGGG AGCGGGAAGG CGAGGTGCGG
CACCTGATCG CCGGACGCTT GAGCGATCTG ACTCCGCTGC TCAACGGCAT CCGCGTACAG
AGCCGCGACT TCCACTGA
 
Protein sequence
MNQGYAELHC LSNFSFQRGA SSALELFQRA KKHGYQALAI TDECTLAGIV RAWQAAKSVE 
LPLIIGSEIR IENGPKLVLL VENIEGYQAL CGLITQARRR TQKGQYQILR EDFSEPLPGL
LVLWVPEAVD EVEEGRWLKQ TFGERLWLAV QLHRGQNDQQ RLAALLSLAD ELQIPAVASG
DVHMHARGRR ALQDTMTAIR HHVPVAEAGL RLHPNGERHL RSLDVLRELY PQTLLDESLK
LARRCTFDLG ELRYQYPKEL VPEEHSASSW LRHLTEQGIA WRWPKGAQPK VLKQIDDELE
LIAELGYESY FLTVHDVVRF AREQKILCQG RGSAANSAVC FALGITEIDP DRTTLLFERF
MSRERNEPPD IDVDFEHERR EEVLQYVFRR YGRRRAALTA VVSTYHASGA IRDVAKALGL
PPDQINALAD CCGHWSDETP PVERLREGGF DPESPLLHRV LSLTGQLIGF PRHLSQHPGG
FVISEQPLDT LVPVENAAMA DRTIIQWDKD DLDAVGLLKV DILALGMLSA IRRCFDLLRR
HRHQDLSLAT IPPEDRPTYD MISRADTIGV FQIESRAQMS MLPRLRPQTF YDLVIEVAIV
RPGPIQGGMV HPYLRRRNKE EEETYPSPEL EVVLKRTLGV PLFQEQVMQI AIVAADYSPG
EADQLRRSMA AWKRHGGLEP HKERLAAGMK KNGYSPEFAA QIFEQIKGFG SYGFPESHAA
SFALLTYASC WLKCHEPAAF ACALINSWPM GFYSPDQILQ DARRHHLQIR PVDVRASDWD
CSLEPIAGEQ PAIRMGLRMI KGFREEDARS IEKARARGAF ADVADLGERA GLDSRAQALL
ADAGALRGLA GHRHRARWEV AGVQKQLGLF AGLPSQEEPD VLLPTPSVSE DLFTDYATLG
TTLGPHPLTL LRNELRARRC RSSRDLLEVE HGRPVSVAGL VTGRQRPGTA SGVTFVTLED
EFGNVNVVVW RDLAERQRQV LVGSQLLKVD GRWEREGEVR HLIAGRLSDL TPLLNGIRVQ
SRDFH