Gene PA14_01020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_01020 
Symbol 
ID4383502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp99431 
End bp100927 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content64% 
IMG OID639322640 
Producthypothetical protein 
Protein accessionYP_788241 
Protein GI116053804 
COG category[S] Function unknown 
COG ID[COG3517] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03355] type VI secretion protein, EvpB/VC_A0108 family 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAT TGAGCACCGA GAACCTGGCC CAGGGCCAGA CCACCACCGA GCAGACCAGC 
GAGTTCGCCA GCCTGCTGCT GCAGGAGTTC AAGCCCAAGA CCGAGCGCGC CCGCGAAGCG
GTGGAGACCG CCGTGCGGAC CCTCGCCGAG CATGCCCTGG AGCAGACCAG CCTGATCTCC
AACGACGCGA TCAAGTCGAT CGAGTCGATC ATCGCGGCGC TTGACGCCAA GCTCACCGCG
CAGGTCAACC TGATCATGCA CCACGCCGAC TTCCAGCAAC TGGAAAGCGC CTGGCGCGGC
CTGCACTACC TGGTCAACAA CACCGAGACC GACGAGCAAC TGAAGATCCG TGTGCTGAAC
ATCTCCAAGC CGGAGCTGCA CAAGACCCTG AAGAAATTCA AGGGCACCAC CTGGGACCAG
AGCCCGATCT TCAAGAAGCT CTACGAAGAG GAATACGGCC AGTTCGGCGG CGAGCCCTAT
GGCTGCCTGG TCGGCGACTA CTACTTCGAC CAGTCGCCGC CGGACGTCGA GCTGCTCGGC
GAGATGGCGA AGATCTCCGC CGCCATGCAT GCGCCGTTCA TCTCCGCCGC CTCGCCGACG
GTGATGGGCA TGGGTTCCTG GCAGGAACTG TCCAACCCGC GCGACCTGAC CAAGATCTTC
ACCACCCCGG AATACGCCGG TTGGCGTTCG CTGCGCGAGT CCGAGGACTC CCGCTACATC
GGCCTGACCA TGCCGCGCTT CCTGGCGCGC CTGCCCTACG GGGCGAAGAC CGATCCGGTG
GAAGAGTTCG CCTTCGAGGA AGAAACCGAC GGCGCCGACA GCAGCAAGTA CGCCTGGGCC
AACTCGGCCT ACGCGATGGC GGTCAACATC AACCGCTCCT TCAAGCTCTA CGGCTGGTGC
TCGCGGATCC GTGGCGTCGA GTCCGGCGGC GAGGTGCAGG GCCTGCCGGC GCACACCTTC
CCCACCGACG ACGGCGGCGT GGACATGAAG TGCCCGACCG AGATCGCCAT TTCCGACCGC
CGCGAGGCGG AGCTGGCGAA GAACGGCTTC ATGCCGCTGC TGCACAAGAA GAACACCGAC
TTCGCCGCCT TCATCGGCGC GCAGTCGCTG CAGAAACCCG CCGAGTACGA CGATCCGGAC
GCCACCGCCA ACGCCAACTT GGCGGCGCGC CTGCCCTACC TGTTCGCCAC CTGCCGCTTC
GCCCATTACC TGAAGTGCAT CGTTCGCGAC AAGATCGGTT CCTTCAAGGA GAAGGACGAG
ATGCAGCGCT GGCTGCAGGA CTGGATCCTC AACTACGTCG ACGGCGACCC GGCCCACTCC
ACCGAGACCA CCAAGGCCCA GCACCCGCTG GCGGCGGCCG AAGTGGTGGT GGAGGAAGTC
GAAGGCAATC CGGGTTACTA CAACTCGAAG TTCTTCCTTC GCCCGCACTA CCAGCTCGAG
GGACTGACGG TATCGCTACG CCTGGTATCC AAGCTGCCTT CGGCCAAAGA GGCCTGA
 
Protein sequence
MAELSTENLA QGQTTTEQTS EFASLLLQEF KPKTERAREA VETAVRTLAE HALEQTSLIS 
NDAIKSIESI IAALDAKLTA QVNLIMHHAD FQQLESAWRG LHYLVNNTET DEQLKIRVLN
ISKPELHKTL KKFKGTTWDQ SPIFKKLYEE EYGQFGGEPY GCLVGDYYFD QSPPDVELLG
EMAKISAAMH APFISAASPT VMGMGSWQEL SNPRDLTKIF TTPEYAGWRS LRESEDSRYI
GLTMPRFLAR LPYGAKTDPV EEFAFEEETD GADSSKYAWA NSAYAMAVNI NRSFKLYGWC
SRIRGVESGG EVQGLPAHTF PTDDGGVDMK CPTEIAISDR REAELAKNGF MPLLHKKNTD
FAAFIGAQSL QKPAEYDDPD ATANANLAAR LPYLFATCRF AHYLKCIVRD KIGSFKEKDE
MQRWLQDWIL NYVDGDPAHS TETTKAQHPL AAAEVVVEEV EGNPGYYNSK FFLRPHYQLE
GLTVSLRLVS KLPSAKEA