Gene PSPTO_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPSPTO_4101 
SymbolhopAK1 
ID1185781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas syringae pv. tomato str. DC3000 
KingdomBacteria 
Replicon accessionNC_004578 
Strand
Start bp4621223 
End bp4622890 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content57% 
IMG OID637395447 
Producttype III helper protein HopAK1 
Protein accessionNP_793862 
Protein GI28871243 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3866] Pectate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACGA TCAACAGAAA CATCTACCCC GTCTCCGGGA TTTCTGCGCA GGATGCCCCT 
GTACAAACTG ATCAGCTCCA GCCGCAAGGC CAGGGCATCA GGCCGGGGCA CAATAGCAAC
CTGATCGACT TCGGACTGAT ACAGCAGGCC AATGGTCCGC ACTCATCGCT GAACACATCG
AGCTCCAGAA TTCAGCCGAC TGACACCAGC ACATCCTCAA ACAGGCTGGG GGGTAATGGC
GATCAGTTAC TGAACAAACT CGTGGAAGCG ATCCGTAATA TCCTCAACAA CCTGCTCTCT
CTGCTGGAAG GCAATCAACA CCAGGGCTCT TCGCCTGCAC AGACCCAGCG TGAACAGACG
CCGACGTCCA CTCAATCGCA CGCTTCGCCT TCCTCGTCGT CTTCATCTTC GCCGTCGACA
TCCTCCCAGT CTTCACCCTC AGTGCCTTCA ACGCCTCAGG GCAACGCAGA AAAACCGTTT
GTGGTGCAGA GCGATCATCC GGCGGAAAAA CCGGTATCGC TGCAGAGAAC CTCAGAGCCA
ACGTCTGTGA CGCCGCCACA AACACCACCG CAGGCTGTCG AGCGAAACAG CATTACCCCG
GACAAGGCAC CGGCCAAACC CGAAGCGGTA AAGCCGGCAG TGGTCAACGA CCCGGTGCTG
CCGAAAACCT CGATCCCTGC CGCCGCCAAG CCTGACAGCA CGGTGACCGC CGCAAAACAC
GCGACGCCCG CTGCCCGTGG CCAGGGCGCT GACATGTCCG GCATGATCGG TTTTGCCAAG
GAAGCCAATA CCACCGGGGG CAACAACGGC GAAGTGGTCA CCGTGAACAC GGTTGCCGAC
CTCAAGAAGT ACATGGAGGA CGACAAAGCC CGCACCGTCA AGCTGGGGGC CAACCTGTCT
GCCGACAGTA AAGTGTCGAT AAATTTCGGG GCCAACAAAA CCCTGCTGGG CACCGATAAA
GGCAACACCC TGCACAACAT CTATCTGGCC AGCGGCAAGA CCGCCAGCAA CGACATTTTC
CAGAATCTGA ACTTCAACCA CGACGCCCGT TACCGTGAAA ACGGCGACAT GCAGATGTTC
ATCAGCAGCG GTCAGAAATA CTGGATCGAC CACATCACCG CTACCGGAAC CAAGGATCAG
AACCCCAAAG GTCTGGATAA ACTGCTCTAC GTGGGCGGCA AGGCAGATAA CGTCAGCCTG
ACCAATTCGA AATTCCAGAA CAACGAGTAT GGCGTGATTC TCGGTCAGCC GGACGACTCG
GCAGCCGCCA AAGCCGAGTA CAAGGGCTAC CCACGGATGA CAATCGCCAA CAACGTGTTC
AGCAACCTCG ATGTCCGCGG GCCCGGTCTG TTTCGTCAGG GCCAATTTGA CGTAGTTAAC
AACTCGATCG ACAAATTCCA CCTCGGTTTC ACTGCGACCG GGAACGCTAC CATCCTGTCG
CAGGCCAACT ATTTCAGCAA CGGTGTCGAT GTTTCCAACA AGGCAAGTAA TAGCGGCGTG
CTGGATGACT ACGGCGATGC GCACTTCAAA GACATCGGCA GTAACGTCAG TTTCACTCAG
AAATCGCCGG TTACCGCCTG GACACCGAGC TACAACCGGG ACGTGAAAAC AGCCGAAGCA
GCCAGAGCCT ATGACCTGGC CAATGCGGGT GCACAGGTCG TGAAATAA
 
Protein sequence
MNTINRNIYP VSGISAQDAP VQTDQLQPQG QGIRPGHNSN LIDFGLIQQA NGPHSSLNTS 
SSRIQPTDTS TSSNRLGGNG DQLLNKLVEA IRNILNNLLS LLEGNQHQGS SPAQTQREQT
PTSTQSHASP SSSSSSSPST SSQSSPSVPS TPQGNAEKPF VVQSDHPAEK PVSLQRTSEP
TSVTPPQTPP QAVERNSITP DKAPAKPEAV KPAVVNDPVL PKTSIPAAAK PDSTVTAAKH
ATPAARGQGA DMSGMIGFAK EANTTGGNNG EVVTVNTVAD LKKYMEDDKA RTVKLGANLS
ADSKVSINFG ANKTLLGTDK GNTLHNIYLA SGKTASNDIF QNLNFNHDAR YRENGDMQMF
ISSGQKYWID HITATGTKDQ NPKGLDKLLY VGGKADNVSL TNSKFQNNEY GVILGQPDDS
AAAKAEYKGY PRMTIANNVF SNLDVRGPGL FRQGQFDVVN NSIDKFHLGF TATGNATILS
QANYFSNGVD VSNKASNSGV LDDYGDAHFK DIGSNVSFTQ KSPVTAWTPS YNRDVKTAEA
ARAYDLANAG AQVVK