Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2863 |
Symbol | kpdC |
ID | 6145321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2937063 |
End bp | 2938490 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641617732 |
Product | 4-hydroxybenzoate decarboxylase, subunit C |
Protein accession | YP_001744887 |
Protein GI | 170681645 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.967142 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTTG ATGATTTACG CAGCTTTTTA CAGGCGCTCG ACGACCATGG GCAATTGCTG AAAATCAGCG AAGAGGTGAA CGCAGAACCG GATCTGGCAG CCGCAGCCAA CGCCACCGGG CGTATCGGCG ACGGCGCGCC CGCGCTGTGG TTTGACAATA TTCGCGGCTT TACCGATGCC CGCGTGGCGA TGAACACCAT CGGTTCCTGG CAGAACCACG CGATTTCCCT CGGCCTGCCG CCAAATACCC CGGTTAAAAA ACAGATCGAT GAGTTTATCC GCCGCTGGGA CAACTTCCCG ATAGCCCCGG AACGCCGAGC CAATCCAGCC TGGGCGCAGA ACACCGTCGA TGGCGAAAAG ATTAACCTGT TCGATATCCT GCCGCTGTTT CGTTTAAACG ACGGTGATGG CGGTTTCTAT CTCGACAAAG CGTGCGTGGT TTCCCGCGAT CCGCTCGACC CGGATAACTT CGGCAAGCAG AACGTCGGTA TCTACCGCAT GGAAGTGAAG GGCAAACGTA AGCTCGGCCT GCAACCGGTG CCGATGCACG ATATCGCGCT GCATCTGCAT AAAGCGGAAG AGCGCGGTGA AGATCTGCCG ATTGCCATCA CCCTCGGTAA CGATCCGATC ATCACCCTGA TGGGGGCCAC GCCGCTGAAA TATGATCAGT CCGAGTACGA AATGGCGGGC GCGCTGCGCG AAAGTCCGTA CCCGATTGCC ACCGCCCCAC TGACCGGTTT TGATGTACCC TGGGGCTCGG AAGTGATCCT CGAAGGGGTT ATCGAAAGTC GTAAACGTGA AATCGAAGGG CCGTTCGGTG AGTTTACCGG GCACTACTCC GGCGGGCGCA ACATGACCGT GGTGCGCATC GATAACGTCT CTTATCGCAG CAAACCCATT TTTGAATCGC TCTATCTCGG GATGCCGTGG ACGGAAATCG ACTACCTGAT GGGGCCAGCC ACCTGCGTGC CGCTGTATCA GCAACTGAAA GCCGAGTTCC CGGAAGTGCA GGCGGTAAAC GCCATGTACA CCCACGGCCT GCTGGCGATT ATCTCCACCA AAAAACGCTA CGGTGGCTTT GCCCGCGCGG TGGGCCTCCG TGCGATGACC ACGCCACACG GTCTGGGCTA TGTGAAGATG GTGATTATGG TCGATGAAGA CGTTGATCCG TTCAACCTGC CGCAGGTGAT GTGGGCGCTC TCTTCGAAAG TGAACCCGGC AGGGGATTTG GTGCAGTTGC CGAATATGTC CGTGCTGGAA CTCGACCCTG GCTCAAGTCC GGCGGGGATC ACCGACAAGC TGATTATCGA CGCCACCACG CCTGTCGCCC CGGACAACCG TGGTCACTAC AGTCAGCCGG TGGTGGATTT ACCGGAAACC AAAGCCTGGG CTGAAAAACT GACCGCTATG CTGGCCGCAC GTCAATAA
|
Protein sequence | MAFDDLRSFL QALDDHGQLL KISEEVNAEP DLAAAANATG RIGDGAPALW FDNIRGFTDA RVAMNTIGSW QNHAISLGLP PNTPVKKQID EFIRRWDNFP IAPERRANPA WAQNTVDGEK INLFDILPLF RLNDGDGGFY LDKACVVSRD PLDPDNFGKQ NVGIYRMEVK GKRKLGLQPV PMHDIALHLH KAEERGEDLP IAITLGNDPI ITLMGATPLK YDQSEYEMAG ALRESPYPIA TAPLTGFDVP WGSEVILEGV IESRKREIEG PFGEFTGHYS GGRNMTVVRI DNVSYRSKPI FESLYLGMPW TEIDYLMGPA TCVPLYQQLK AEFPEVQAVN AMYTHGLLAI ISTKKRYGGF ARAVGLRAMT TPHGLGYVKM VIMVDEDVDP FNLPQVMWAL SSKVNPAGDL VQLPNMSVLE LDPGSSPAGI TDKLIIDATT PVAPDNRGHY SQPVVDLPET KAWAEKLTAM LAARQ
|
| |