Gene PC1_3223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPC1_3223 
Symbol 
ID8134195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePectobacterium carotovorum subsp. carotovorum PC1 
KingdomBacteria 
Replicon accessionNC_012917 
Strand
Start bp3631513 
End bp3632853 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content52% 
IMG OID644866523 
Productglycosyltransferase, MGT family 
Protein accessionYP_003018782 
Protein GI253689592 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACATA TTCTTATGGC GGCGATGGCA ACGCCAGGGC ATGTTTATCC TTTGCTGACT 
ATCGCCCGCT ATTTGGTTGA GCAAGGGAAT GACGTCACGC TGTTTAGCGG GGCGTTGTTC
CGCGAGCAGG CGGAGGCGGC TGGCGTCGGG TTTATCCCTT TTAGTGATGA CATTGATTTT
GACTACCGAC ACCTTGAACA GCATTTCCCC CAGCGTGCCA TGCTGCCGCC GGGTAATGCA
CAGATGGCTT TGGCGCTGAA AGATTTTTTT GCAGCACCGA TTCCCTTGTT GGATCGCCAG
TTGCGTGACG CGTTGGCAAA AACCGATGCC GATCTGCTCA TGGTTGAAAA CTGTTTTTAT
GGTGTCCTGC CGTTGTTGCT TTCTGGGCAG CGTCCGCCCG TCATCGTCAT CGGGGTTACG
CCGCTGTCCT ATTCGACGCG AGATTCAGTT TTCTATGGGC CTCGTATTCC TCCGAAATTG
CTGCCGCCCG ATCTGAGCCA TGAACAGCTT GTTGATGAGG AAACACAGGC ATTGATGAGT
GAAGTTCAGC GCGCGTTTGA TCATGCGATG ATGCAGTCGG GAGGGAAGCC CCTCGAACAG
CCTTTTATGG ATGCGCTGAT TGGCCGCTGC GATCGTTTCT TGCAGCTATC CACTACCGAG
CTAGAGTATC AGCGTGATGA TTTGCCACAA AGTGTGCGAT TTATCGGGCC ACTATCTCGC
CAGATTGCAG CAGAACGTGT TCCTGAATGG TGGACGTCGG ATGACAGTCG GCCGCTTATT
ATCGTTTCAC AAGGAACACT GGCGAATGTC GATCTCCAGC AACTGATTGG GCCGACACTG
CGTGCGCTGG CTGATTTACC TGTCAGGGTG CTGGCGACCA CAGGAGGGCG TGCGGTTGAT
TCACTGCAAC CCTCTTTGCC TGAAAATGCC AGAGTGGTGC GTTTCCTCTC TTATGATGAT
TGGTTGCCCA GGGCGTCGAT CTTCATCACC AACGGTGGAT ACGGTTCGAT AAATGCTGCA
TTGAAGGATG GTGTGCCTTT AATTGTGGCT GGGGTAGGAG AAGATAAGCA GGAAAGTGCT
GCACGGGTGG TATTCGCCAA ATGTGGAATT AACTTGCAAA CCAGTACGCC AAGTGAACAG
CAGATTAAGC AGTCTGTTAT TGAGATACTG GAGGATCCGG GCTATTTGCA GCGCGCAAGA
TGGATTAAGG CAGATTATGC CAGCCACGAT GCGTTGGCTC TGATTCAGGC CGAAGTTCAT
GCACTCTCTT TTTCAAAGGA ATACCTGTCA AAGGAGGACG GTATAGCATC AGCCAGGGAT
GCTTCTCTGC TCCGTGTTTA G
 
Protein sequence
MAHILMAAMA TPGHVYPLLT IARYLVEQGN DVTLFSGALF REQAEAAGVG FIPFSDDIDF 
DYRHLEQHFP QRAMLPPGNA QMALALKDFF AAPIPLLDRQ LRDALAKTDA DLLMVENCFY
GVLPLLLSGQ RPPVIVIGVT PLSYSTRDSV FYGPRIPPKL LPPDLSHEQL VDEETQALMS
EVQRAFDHAM MQSGGKPLEQ PFMDALIGRC DRFLQLSTTE LEYQRDDLPQ SVRFIGPLSR
QIAAERVPEW WTSDDSRPLI IVSQGTLANV DLQQLIGPTL RALADLPVRV LATTGGRAVD
SLQPSLPENA RVVRFLSYDD WLPRASIFIT NGGYGSINAA LKDGVPLIVA GVGEDKQESA
ARVVFAKCGI NLQTSTPSEQ QIKQSVIEIL EDPGYLQRAR WIKADYASHD ALALIQAEVH
ALSFSKEYLS KEDGIASARD ASLLRV