Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnuc_0121 |
Symbol | |
ID | 5053211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 |
Kingdom | Bacteria |
Replicon accession | NC_009379 |
Strand | + |
Start bp | 113982 |
End bp | 116180 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640470272 |
Product | TPR repeat-containing protein |
Protein accession | YP_001154906 |
Protein GI | 145588309 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [R] General function prediction only |
COG ID | [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.261868 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCCC CTCAAACTAT TCAGCTTCTT CAAAAAGCCG TGCAAGAAAT TCAGGCTAAT GAGCTGGAAT CAGCAAAATT ACATCTCAAA GCCGTATTGA GTAAAGAAAG AAGTCAGCCA GATGCGCTGC GTTTTCTGGG GATAATTGCC GGCCTTCAAA AAGATTGGAC TCAAGCGCTG GAATTAATTG ATCAGGCAAT TGCAGTTAGT CCAGAAAATG GATTGGCATA CAGCAATCGA GGCAACGTTT TAAGCGAACT AAAGCGCCAT GAAGAGGCGC TCACTTGTTA TGAAAAGGCG ATTTCTCTAC AGCCAGATTA TGCCGAGGCC TATAGTAATC AAGGGAATGC TTTGCAAGCT TTGGGGCGCT TTGAAGAGGC GCTCATTTGT TATGAAAAGG CGATTTCTCT ACAGCCAGAT TATGCCGAGG CCTATAGTAA TCAAGGGAAT GCATTTTTAG CATTGTCGCG TTATCAGGCG GCCTTAGCTT GTTATGAAAA GACCGTATTG TTAGACGCCA ATAATGCCAA GGCATTTTAT GGTGCAGCCA TTATTTTTGA TCTTCAAAAA AAGTATGACT ATTCTTTAGG TTGTTGCAAT CAGGCATTAA AAATACAGCC TCACTATCTT GAGGCTTTGG CACTCAGAGG CACCATTTAT TTCCGGGCCA AGGACTACGA ACAGGCGCTT ACCAATTTTG ATGAAATTCT TCTCATTCAC CCCCAATCCA TTCAGGCTTG GTTGTCAAAG GGTGATATTT ACGCAGAAAC GAAGCAGTTT GAAAAAGCTG ATGAAGCTTT TAAGAAGGCT TTAGTAATAG ACCCTACAAA AGATTTTCTG TTTGGGCTTT CTATACAAAA TCAGATACAA ATGTGTCAAT GGACTGACCT GACGAATCAA GTGAAGGATT TGGTTCATCG AGTTAGAGAT GGTGAAAAAG TCGCAATTCC ATACAATCTC ATCTCTTTAG TAGATGATAA GAGTTTAATT AGGTGCTCCA TTGAGATTTA CGCACAGGCC ATGCGGGGTA ATATTGAGAG AGTTAAGCAA GAGAAAATTT CTCCAAAGAA AAAAATACGG CTTGGTTATT TTTCTGCAGA TTTTCATAAT CACCCTACGG CATATTTAGT TGCCGAACTT TTTGAGTGCC ATGACAAAGA AAAATTTGAA CTTTTTGGAT TTGTATTTGG CCGTAATGCT CCAGATGAGA TGCGTAACAG GCTGGAGAAA TCATTTGACC AATTTCTGGA TGTTGAAGAT AGATCGGATG AAGAGATTGC TCAGCTAGCG CGTGAAATGC ACATCGATAT AGCCATTGAT TTAAAAGGCT TTACCAAAGA AGCGCGCCCT AAAATATTCA TGTATGGCGC TGCCCCGATT CAAATTAGCT ATCTAGCATT TCCTGGGACG ATGGGATTGC CGTGTTTTGA TTATGTGATC GCCGATCCTA TTTTGATTCC GGAAAAGCAT CAAGATGGCA TGGTAGAGAA AATTATTTAT ATGCCCGATA GCTATCAAGT CAATGATAGA AGCAGGAAGA TATCACCACT GATCAAGTCT CGTAAAGAGC TTGGGCTTCC TGAGTCGGGA TTTGTGTTTT GCTGTTTTAA TAACAACTAT AAGATTACTC CAGCAGTTTT AGATGGCTGG GTAAAAATTT TGCTGGCAGT AGAGGGTAGT GTACTTTGGC TTTATGAAGA CAACCCTATT GCGGTCGCTA ACTTAAAGCA AGAGGCTTTA ACAAGAGGCT TAGATGCTGG CAGATTTATT TTCGCTGGGC GCATGGATTC AGCAGATCAT TTAGCTAGAT ATAAGAATGC TAATTTATTC CTAGACACTA CCCCGTGTAA TGCACACACA ACAGCTAGTG ATGCTTTATG GGCTGGGTTA CCGGTACTCA CGCTAGCTGG AGAGTCTTTT GGTGCTCGCG TTGCCGCAAG TCTTAATAAT GCCGTTGGAC TTTCTGGTTT AACGGTAGAA ACACAAGAAG AATATGAAGC GCTAGCAATT CAGTTAGCAA CTAGTCCAAG CAGATTGAAG GAACTTAAAG ATCGACTTGA GAGAAATCTG TTAACTGCCC CTCTATTTGA TACGCCGCTA TTTACAAAAA ATCTAGAGGC CGGATACATA GAGGCTTATG AGCGACATCA ACAAAATATG CCGTTAGATC ATATCTATAT TGACGCCCCT AAAGGCTAA
|
Protein sequence | MSSPQTIQLL QKAVQEIQAN ELESAKLHLK AVLSKERSQP DALRFLGIIA GLQKDWTQAL ELIDQAIAVS PENGLAYSNR GNVLSELKRH EEALTCYEKA ISLQPDYAEA YSNQGNALQA LGRFEEALIC YEKAISLQPD YAEAYSNQGN AFLALSRYQA ALACYEKTVL LDANNAKAFY GAAIIFDLQK KYDYSLGCCN QALKIQPHYL EALALRGTIY FRAKDYEQAL TNFDEILLIH PQSIQAWLSK GDIYAETKQF EKADEAFKKA LVIDPTKDFL FGLSIQNQIQ MCQWTDLTNQ VKDLVHRVRD GEKVAIPYNL ISLVDDKSLI RCSIEIYAQA MRGNIERVKQ EKISPKKKIR LGYFSADFHN HPTAYLVAEL FECHDKEKFE LFGFVFGRNA PDEMRNRLEK SFDQFLDVED RSDEEIAQLA REMHIDIAID LKGFTKEARP KIFMYGAAPI QISYLAFPGT MGLPCFDYVI ADPILIPEKH QDGMVEKIIY MPDSYQVNDR SRKISPLIKS RKELGLPESG FVFCCFNNNY KITPAVLDGW VKILLAVEGS VLWLYEDNPI AVANLKQEAL TRGLDAGRFI FAGRMDSADH LARYKNANLF LDTTPCNAHT TASDALWAGL PVLTLAGESF GARVAASLNN AVGLSGLTVE TQEEYEALAI QLATSPSRLK ELKDRLERNL LTAPLFDTPL FTKNLEAGYI EAYERHQQNM PLDHIYIDAP KG
|
| |