Gene Pnuc_1343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnuc_1343 
Symbol 
ID5052750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 
KingdomBacteria 
Replicon accessionNC_009379 
Strand
Start bp1419212 
End bp1420321 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content45% 
IMG OID640471515 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001156121 
Protein GI145589524 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACTA CCCGCCGCTC AGCCCTAAAA ACAATTGCCG GAGCTTTGGC TATTCCTGCG 
CTATCGCCTA TCTCTTCTGT ATTTGCGCAA GCACCTGTCG GATGGACTAC TTACGAAATC
GTAACCGAAG TGAATTTAGA GTCGCCAAAT GGGGCTGCAG AATCTTGGAT TCCATTACCC
CTGGTATTGG ACACTAATTA CTTCCAAACA TTAGCCATCA GATCTGAAGC AAGCGATCCT
AAAGCAGTAA ACCAAATTTA CGAGACGCCA GATAAACAAG CGCGCATGCT TTGGACTAAA
TGGGATAAAT CTGCAACAAA CCATAGTGTC AAAGTTTCTA TCTTGGTAAG TACATTCAAC
CGCCATTTAG AAATGGCCCC CCCTAGCCCT GCACTAAAGC TCTCAAGAGA AGATCAGCGT
TTTTGGACTC GATCTACAAA ATACCTTCCC ACTGATGGCA TTGTGAAAAC TAAATCACAG
GAAGCGCTTG CTAATACCCC TGCAAATGCT ACTGATGTAG AAAAAGCAAA AGCCATCTAC
AACTGGGTTG TAGATAACAC TCATCGTGAT CCTAAAACTC GCGGTTGCGG TCAAGGTGAT
GTGAAGTTGA TGCTGGAAAC CAATAATCTG GGAGGCAAGT GTGCCGATAT CAATGCTGTC
TTTGTCGCCT TAGCGCGTTC AGCTGGTATT CCAGCCCGTG ATGTCTACGG TATTCGTATT
GCCGATTCTG CACGTGGCTA TAAGAGCCTT GGTAAATCAG GCGATATCAC CAAAGCACAG
CACTGCCGCG CAGAGTTCTA TGCAAATGGT TATGGTTGGG TTCCAGTCGA TCCAGCAGAC
GTTCGTAAAG TGATCTTAGA AGAAACAGGT GGCTTGGCAG TGAACGACCC TAAAGTATTG
GCGATTCGTG AGTACTTATT TGGCAACTGG GAAATGAATT GGATGGCATA CAACTATGAT
CACGATATTG CATTGCCAGG ATCCAAGCTA GGCAGCAAAG GTGATATTCC TTTCTTGATG
TACCCACAAG CAGAAAATAC TGAAGGTCGC TTTGATTCAC TAGACCCAGA TAATTTCAAA
TACAAAATTA CTAGCCGTCG GATCGGTTAA
 
Protein sequence
MTTTRRSALK TIAGALAIPA LSPISSVFAQ APVGWTTYEI VTEVNLESPN GAAESWIPLP 
LVLDTNYFQT LAIRSEASDP KAVNQIYETP DKQARMLWTK WDKSATNHSV KVSILVSTFN
RHLEMAPPSP ALKLSREDQR FWTRSTKYLP TDGIVKTKSQ EALANTPANA TDVEKAKAIY
NWVVDNTHRD PKTRGCGQGD VKLMLETNNL GGKCADINAV FVALARSAGI PARDVYGIRI
ADSARGYKSL GKSGDITKAQ HCRAEFYANG YGWVPVDPAD VRKVILEETG GLAVNDPKVL
AIREYLFGNW EMNWMAYNYD HDIALPGSKL GSKGDIPFLM YPQAENTEGR FDSLDPDNFK
YKITSRRIG