Gene PC1_3371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPC1_3371 
Symbol 
ID8134351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePectobacterium carotovorum subsp. carotovorum PC1 
KingdomBacteria 
Replicon accessionNC_012917 
Strand
Start bp3801774 
End bp3803108 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content58% 
IMG OID644866672 
Productglycoside hydrolase family 28 
Protein accessionYP_003018923 
Protein GI253689733 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCACT CTATCCAGTC ATACTCCCCC GCCGCCGACG GCGTCACACC GGATACCGCC 
ATTTTTCAGC AGGCTATCGA CAGGATTGCG GCACAGGGCG GCGGCACCTT GACGGTAGAA
CCGGGGCGCT ATCTGTTAGG GGGCTTGCTG CTGCCTTCCA ATTTTTGCCT GCAACTGGAG
GCGGGGGCTG AGCTCATCGT CAGCGGCGAC TATGAGCAGT TTACGCAGGC TACCACCATC
AGCATGGCCG AGCTGTCACA TCGGGCGTTT CTTTATGCTT ACCAACAGCG CAATATCACG
ATCTGCGGTC AGGGTAAGAT CATGGGAAAT GCCGACGCCT ATTTCTCGGT GGAACCCGAC
GCTCAAGGCT ATCGCCTGCC TGCGCAACAT CGCCCACGCA TTGTGGTTTT TGAGGATTGC
GAACACATCC GCCTGTGTGA CTTTACGATT GAACACGCGC CAATGTGGAC CGTGCATTTG
GTCAGCTGTC GTCACGTCAT CGTCGAACGC CTGACGATTG ATAACGATCT GAGCATGGCG
AATACCGATG CGCTGGATCT CGATAGCTGC CAGCAGGTAC AAATCAGCAA CTGCTCGCTG
AGCGCCGCCG ACGATGCACT GTGCATCAAA ACCACCAATA AGCCGCCACA TCTGCAACGT
AAGGTGCAGC AGGTCGTTAT CAGCAATTGC CTGTTGCGCT CCAAGAGCTG TGCGCTGAAG
GTCGGCACCG AAACCTTTGC CGACATTGAA GATATCTCCG TCAGCAACTG TGCCATTTAC
GATACCAACC GCGCGATCGG CCTGATCTCC CGCGATGGTG GCACGTTCCG ACGTTTGCAG
TTCAGCAACA TCACATTCCA GTGTGTCGCC GCACATCCGT GCCACTGGGG CAAAGCCGAT
CCGATCTTTA TCTCCGTACG CTATCGCGAT CCCGCCATCG AACCGGGCCG GATCGAAGCG
GTGCAATTTT CGCAGATCGC GGGGATCAGC GAGGGGGCGA TTAACCTGCA CAGCACGCCC
GCAGGCTACA TTCGTGACAT CCATTTCCAT GCCGTGCACC TCGAACAGCG GCAGAGCGAC
TCGCCGGAAC AGGGCATGTA CGATGTGCGT CCGCCCTGCA ACCCGGAACG CCCTACGGGC
ATGGGGTTAG ACAATGCGTA TCGGGTCGAT CCCATTACCG GGCGCGCATT CGGCGTTGAG
CACTACCCAG GCGGCATGCC CGCATTATTT GCTCGTGGCG TCCTGAACCT GACCACCAGC
CACATGACGA TCCACCGTCC CGATCCGCTC CCTTCAGGCT GGCATCACGC CACGATCGTG
CAGTTGGAAG AATAA
 
Protein sequence
MKHSIQSYSP AADGVTPDTA IFQQAIDRIA AQGGGTLTVE PGRYLLGGLL LPSNFCLQLE 
AGAELIVSGD YEQFTQATTI SMAELSHRAF LYAYQQRNIT ICGQGKIMGN ADAYFSVEPD
AQGYRLPAQH RPRIVVFEDC EHIRLCDFTI EHAPMWTVHL VSCRHVIVER LTIDNDLSMA
NTDALDLDSC QQVQISNCSL SAADDALCIK TTNKPPHLQR KVQQVVISNC LLRSKSCALK
VGTETFADIE DISVSNCAIY DTNRAIGLIS RDGGTFRRLQ FSNITFQCVA AHPCHWGKAD
PIFISVRYRD PAIEPGRIEA VQFSQIAGIS EGAINLHSTP AGYIRDIHFH AVHLEQRQSD
SPEQGMYDVR PPCNPERPTG MGLDNAYRVD PITGRAFGVE HYPGGMPALF ARGVLNLTTS
HMTIHRPDPL PSGWHHATIV QLEE