Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PC1_2961 |
Symbol | |
ID | 8133916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pectobacterium carotovorum subsp. carotovorum PC1 |
Kingdom | Bacteria |
Replicon accession | NC_012917 |
Strand | + |
Start bp | 3340971 |
End bp | 3342188 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644866244 |
Product | Arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_003018520 |
Protein GI | 253689330 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00892323 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATGA AAAAACGCGT ACTGATGGCC GCCATGCTGG CAACTGGCCT GTTGACCTTC TCCCTGCCGC AAACCCTGTA TGCCGCAGAG AATGTGACGA TCAATAAGTT GACGAACGTC CCTGCCGACT TCATTAAAGG CGCGGATATT TCCATGCTGA ATGAGGTGGA AAAGCACGGC GGAAAATTTT ATGACGAGAA TGGCAAACAG AAAGACGCCA TGCTGATTCT GAAAGAGAAT GGGATTAACT ATATTCGCCT GCGCATCTGG AACGATCCGA AAGATGCAGC GGGCAACGCC TACGGCGGCG GTAACAACGA TCTGGCCACC ACGCTGGCGC TGGCTAAACG CGCCAAAGCG AACGGCATGA AAGTGCTGCT GGATTTCCAC TACAGCGATT TCTGGACCGA TCCGGCTCAC CAAAACAAGC CTAAAGCCTG GTCTGGCCTA GACATGGCAA AGCTCACCAC GGCCGTACAT GATTTCACTA AAGCCACGAT TAGCGAATTC CAGAAAGCGG GCATCATGCC GGATATGGTG CAAATCGGTA ACGAACTGAA CGGCGGCATG CTGTGGCCGG AAGGAAAAAG CTGGGGCCAG GGTGGCGGCG AGTTTGATCG CCTCGCGGCG CTGCTGAAAG CCGGTATTCA GGGTGTTAAG GACGTACAAG GTTCGAATAA CGTCAAAATC ATGCTGCATC TGGCAGAAGG CACCAAAAAC GACACCTTTA TCTGGTGGTT CGATGAAATC GTCAAACGCA ATGTTCCGTT CGATGTTATC GGTGCCTCGT TCTACACCTA CTGGAACGGC CCCATCAGCG CGTTGCAGTA CAACATGAAC GACGTGACCA AACGCTACAA TAAAGACATC ATCGTGGTCG AAGCCGCCTA TGCCTATACG CTGGAAAACT GCGATAACGC GGAAAACAGC TTCCAGCAAA AAGAGCTGGA TGCAGGCGGC TATCCGGCCT CCGTTCAGGG TCAGGCCAAC TACCTGCACG ATCTGATGCA AAGCATCATC AACGTCCCCA ATCAGCGCGG CAAAGGCATC TTCTATTGGG AGCCCATCTG GCTGCCTACC CCCGGCGCAA CCTGGGCCAC GAAAGCAGGT ATGAAATACA ACAATGACGA ATGGAAGGAA GGCAACGCAC GAGAAAATCA GGCGCTGTTC GACTGCAAAG GCAACGTCCT GCCTTCTATC AAAGCTTTTA AGCCGTAA
|
Protein sequence | MKMKKRVLMA AMLATGLLTF SLPQTLYAAE NVTINKLTNV PADFIKGADI SMLNEVEKHG GKFYDENGKQ KDAMLILKEN GINYIRLRIW NDPKDAAGNA YGGGNNDLAT TLALAKRAKA NGMKVLLDFH YSDFWTDPAH QNKPKAWSGL DMAKLTTAVH DFTKATISEF QKAGIMPDMV QIGNELNGGM LWPEGKSWGQ GGGEFDRLAA LLKAGIQGVK DVQGSNNVKI MLHLAEGTKN DTFIWWFDEI VKRNVPFDVI GASFYTYWNG PISALQYNMN DVTKRYNKDI IVVEAAYAYT LENCDNAENS FQQKELDAGG YPASVQGQAN YLHDLMQSII NVPNQRGKGI FYWEPIWLPT PGATWATKAG MKYNNDEWKE GNARENQALF DCKGNVLPSI KAFKP
|
| |