Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_18510 |
Symbol | algE |
ID | 4381508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | - |
Start bp | 1588212 |
End bp | 1589684 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639324033 |
Product | alginate production outer membrane protein AlgE |
Protein accession | YP_789621 |
Protein GI | 116051542 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00972901 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCT CCCGTTCCGT CAACCCGCGG CCGTCCTTCG CGCCGCGCGC CCTGTCCCTG GCCATCGCCC TGCTGCTCGG CGCGCCGGCG TTCGCCGCCA ACAGCGGCGA GGCGCCGAAG AACTTCGGCC TGGACGTGAA GATCACCGGC GAATCGGAAA ACGATCGCGA CCTCGGCACC GCTCCCGGCG GCACCCTCAA CGACATCGGT ATCGACCTGC GGCCCTGGGC CTTCGGCCAG TGGGGCGACT GGAGCGCCTA CTTCATGGGC CAGGCGGTGG CCGCCACCGA CACCATCGAG ACCGACACCC TGCAATCGGA CACCGACGAC GGCAACAACA GCCGCAACGA CGGTCGCGAG CCGGACAAGA GCTACCTCGC CGCACGCGAA TTCTGGGTCG ACTACGCCGG CCTCACCGCC TACCCCGGCG AGCACCTGCG CTTCGGCCGC CAGCGCCTGC GGGAAGACAG CGGCCAGTGG CAGGACACCA ACATCGAGGC GCTGAACTGG AGCTTCGAGA CCACCCTGCT CAACGCCCAT GCCGGGGTCG CCCAGCGTTT CAACGAATAC CGCACCGACC TCGACGAACT GGCTCCGGAG GACAAGGACC GCACCCATGT GTTCGGCGAC ATCTCCACCC AGTGGGCGCC GCACCACCGC ATCGGCGTGC GCATCCACCA CGCCGACGAC AGCGGCCACC TGCGCCGCCC CGGCGAGGAA GTCGACAACC TCGACAAGAC CTATACCGGC CAGCTCACCT GGCTCGGCAT CGAGGCTACC GGCGATGCCT ACAACTATCG TTCGAGCATG CCGCTGAACT ACTGGGCCAG CGCCACCTGG CTGACCGGCG ACCGCGACAA CCTGACCACC ACCACGGTCG ACGACCGGCG CATCGCCACC GGCAAGCAGA GCGGCGACGT CAATGCCTTC GGCGTCGACC TCGGCCTGCG CTGGAACATC GACGAGCAAT GGAAGGCCGG CGTCGGCTAC GCCCGCGGCA GCGGCGGTGG CAAGGACGGC GAGGAGCAGT TCCAGCAGAC CGGGCTGGAG AGCAATCGCT CCAACTTCAC CGGCACCCGC TCGCGCGTGC ACCGCTTCGG CGAAGCCTTC CGCGGCGAAC TGAGCAACCT CCAGGCAGCC ACCCTGTTCG GCTCCTGGCA ACTGCGCGAG GACTACGACG CGAGCCTGGT CTACCACAAG TTCTGGCGCG TCGACGACGA CTCCGACATC GGCACCAGCG GCATCAACGC CGCCCTGCAA CCGGGCGAGA AGGACATCGG CCAGGAACTC GACCTGGTGG TGACCAAGTA CTTCAAGCAA GGCCTGCTGC CGGCCTCGAT GAGCCAGTAC GTCGACGAGC CCTCGGCGCT GATCCGCTTC CGCGGCGGCC TGTTCAAGCC GGGCGACGCC TACGGGCCGG GCACCGACTC GACCATGCAC CGCGCCTTCG TCGACTTCAT CTGGCGCTTC TGA
|
Protein sequence | MNSSRSVNPR PSFAPRALSL AIALLLGAPA FAANSGEAPK NFGLDVKITG ESENDRDLGT APGGTLNDIG IDLRPWAFGQ WGDWSAYFMG QAVAATDTIE TDTLQSDTDD GNNSRNDGRE PDKSYLAARE FWVDYAGLTA YPGEHLRFGR QRLREDSGQW QDTNIEALNW SFETTLLNAH AGVAQRFNEY RTDLDELAPE DKDRTHVFGD ISTQWAPHHR IGVRIHHADD SGHLRRPGEE VDNLDKTYTG QLTWLGIEAT GDAYNYRSSM PLNYWASATW LTGDRDNLTT TTVDDRRIAT GKQSGDVNAF GVDLGLRWNI DEQWKAGVGY ARGSGGGKDG EEQFQQTGLE SNRSNFTGTR SRVHRFGEAF RGELSNLQAA TLFGSWQLRE DYDASLVYHK FWRVDDDSDI GTSGINAALQ PGEKDIGQEL DLVVTKYFKQ GLLPASMSQY VDEPSALIRF RGGLFKPGDA YGPGTDSTMH RAFVDFIWRF
|
| |