Gene Pnuc_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnuc_0121 
Symbol 
ID5053211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 
KingdomBacteria 
Replicon accessionNC_009379 
Strand
Start bp113982 
End bp116180 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content41% 
IMG OID640470272 
ProductTPR repeat-containing protein 
Protein accessionYP_001154906 
Protein GI145588309 
COG category[O] Posttranslational modification, protein turnover, chaperones
[R] General function prediction only 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family
[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.261868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCCC CTCAAACTAT TCAGCTTCTT CAAAAAGCCG TGCAAGAAAT TCAGGCTAAT 
GAGCTGGAAT CAGCAAAATT ACATCTCAAA GCCGTATTGA GTAAAGAAAG AAGTCAGCCA
GATGCGCTGC GTTTTCTGGG GATAATTGCC GGCCTTCAAA AAGATTGGAC TCAAGCGCTG
GAATTAATTG ATCAGGCAAT TGCAGTTAGT CCAGAAAATG GATTGGCATA CAGCAATCGA
GGCAACGTTT TAAGCGAACT AAAGCGCCAT GAAGAGGCGC TCACTTGTTA TGAAAAGGCG
ATTTCTCTAC AGCCAGATTA TGCCGAGGCC TATAGTAATC AAGGGAATGC TTTGCAAGCT
TTGGGGCGCT TTGAAGAGGC GCTCATTTGT TATGAAAAGG CGATTTCTCT ACAGCCAGAT
TATGCCGAGG CCTATAGTAA TCAAGGGAAT GCATTTTTAG CATTGTCGCG TTATCAGGCG
GCCTTAGCTT GTTATGAAAA GACCGTATTG TTAGACGCCA ATAATGCCAA GGCATTTTAT
GGTGCAGCCA TTATTTTTGA TCTTCAAAAA AAGTATGACT ATTCTTTAGG TTGTTGCAAT
CAGGCATTAA AAATACAGCC TCACTATCTT GAGGCTTTGG CACTCAGAGG CACCATTTAT
TTCCGGGCCA AGGACTACGA ACAGGCGCTT ACCAATTTTG ATGAAATTCT TCTCATTCAC
CCCCAATCCA TTCAGGCTTG GTTGTCAAAG GGTGATATTT ACGCAGAAAC GAAGCAGTTT
GAAAAAGCTG ATGAAGCTTT TAAGAAGGCT TTAGTAATAG ACCCTACAAA AGATTTTCTG
TTTGGGCTTT CTATACAAAA TCAGATACAA ATGTGTCAAT GGACTGACCT GACGAATCAA
GTGAAGGATT TGGTTCATCG AGTTAGAGAT GGTGAAAAAG TCGCAATTCC ATACAATCTC
ATCTCTTTAG TAGATGATAA GAGTTTAATT AGGTGCTCCA TTGAGATTTA CGCACAGGCC
ATGCGGGGTA ATATTGAGAG AGTTAAGCAA GAGAAAATTT CTCCAAAGAA AAAAATACGG
CTTGGTTATT TTTCTGCAGA TTTTCATAAT CACCCTACGG CATATTTAGT TGCCGAACTT
TTTGAGTGCC ATGACAAAGA AAAATTTGAA CTTTTTGGAT TTGTATTTGG CCGTAATGCT
CCAGATGAGA TGCGTAACAG GCTGGAGAAA TCATTTGACC AATTTCTGGA TGTTGAAGAT
AGATCGGATG AAGAGATTGC TCAGCTAGCG CGTGAAATGC ACATCGATAT AGCCATTGAT
TTAAAAGGCT TTACCAAAGA AGCGCGCCCT AAAATATTCA TGTATGGCGC TGCCCCGATT
CAAATTAGCT ATCTAGCATT TCCTGGGACG ATGGGATTGC CGTGTTTTGA TTATGTGATC
GCCGATCCTA TTTTGATTCC GGAAAAGCAT CAAGATGGCA TGGTAGAGAA AATTATTTAT
ATGCCCGATA GCTATCAAGT CAATGATAGA AGCAGGAAGA TATCACCACT GATCAAGTCT
CGTAAAGAGC TTGGGCTTCC TGAGTCGGGA TTTGTGTTTT GCTGTTTTAA TAACAACTAT
AAGATTACTC CAGCAGTTTT AGATGGCTGG GTAAAAATTT TGCTGGCAGT AGAGGGTAGT
GTACTTTGGC TTTATGAAGA CAACCCTATT GCGGTCGCTA ACTTAAAGCA AGAGGCTTTA
ACAAGAGGCT TAGATGCTGG CAGATTTATT TTCGCTGGGC GCATGGATTC AGCAGATCAT
TTAGCTAGAT ATAAGAATGC TAATTTATTC CTAGACACTA CCCCGTGTAA TGCACACACA
ACAGCTAGTG ATGCTTTATG GGCTGGGTTA CCGGTACTCA CGCTAGCTGG AGAGTCTTTT
GGTGCTCGCG TTGCCGCAAG TCTTAATAAT GCCGTTGGAC TTTCTGGTTT AACGGTAGAA
ACACAAGAAG AATATGAAGC GCTAGCAATT CAGTTAGCAA CTAGTCCAAG CAGATTGAAG
GAACTTAAAG ATCGACTTGA GAGAAATCTG TTAACTGCCC CTCTATTTGA TACGCCGCTA
TTTACAAAAA ATCTAGAGGC CGGATACATA GAGGCTTATG AGCGACATCA ACAAAATATG
CCGTTAGATC ATATCTATAT TGACGCCCCT AAAGGCTAA
 
Protein sequence
MSSPQTIQLL QKAVQEIQAN ELESAKLHLK AVLSKERSQP DALRFLGIIA GLQKDWTQAL 
ELIDQAIAVS PENGLAYSNR GNVLSELKRH EEALTCYEKA ISLQPDYAEA YSNQGNALQA
LGRFEEALIC YEKAISLQPD YAEAYSNQGN AFLALSRYQA ALACYEKTVL LDANNAKAFY
GAAIIFDLQK KYDYSLGCCN QALKIQPHYL EALALRGTIY FRAKDYEQAL TNFDEILLIH
PQSIQAWLSK GDIYAETKQF EKADEAFKKA LVIDPTKDFL FGLSIQNQIQ MCQWTDLTNQ
VKDLVHRVRD GEKVAIPYNL ISLVDDKSLI RCSIEIYAQA MRGNIERVKQ EKISPKKKIR
LGYFSADFHN HPTAYLVAEL FECHDKEKFE LFGFVFGRNA PDEMRNRLEK SFDQFLDVED
RSDEEIAQLA REMHIDIAID LKGFTKEARP KIFMYGAAPI QISYLAFPGT MGLPCFDYVI
ADPILIPEKH QDGMVEKIIY MPDSYQVNDR SRKISPLIKS RKELGLPESG FVFCCFNNNY
KITPAVLDGW VKILLAVEGS VLWLYEDNPI AVANLKQEAL TRGLDAGRFI FAGRMDSADH
LARYKNANLF LDTTPCNAHT TASDALWAGL PVLTLAGESF GARVAASLNN AVGLSGLTVE
TQEEYEALAI QLATSPSRLK ELKDRLERNL LTAPLFDTPL FTKNLEAGYI EAYERHQQNM
PLDHIYIDAP KG