Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_1806 |
Symbol | |
ID | 3677061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | - |
Start bp | 1978658 |
End bp | 1980070 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637713368 |
Product | proline-rich protein |
Protein accession | YP_318419 |
Protein GI | 75675998 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.321143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.323939 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCATC GATATCAGGA CCGGCCCTTC CCGGAGGACG ACGATTACGG GCGCGACGAC CAGCGCGCGA CTGCTCCCAC GGAGCCGGAT CCGCTTGCGG AACTGGCGCG CCTGATTGGT CAGGGCGATT CTTTCGGTGA GCCCAGGCGG GAAACGCCGA TGGCTCCCGT GGATGATCCC GTCGACCATT TCGAGCCGCT GCCGCCGATC GAGGAACCGA TCGAGGGGGA GCCACCCGCC GGACCGCCGT CCTGGATTCA AAGGCGGATG GCCCGGCCCG ATCCTTCGCC GGAGCAACTC GAATCCTCGG CGCATCCGGT GCTGCGGCGC GCGGCGGTTT ATCCAGACAG CCATCAGCCT GAGCCGGAAT CTTATCAGGC CGATCCCGGC CGTTACGACG ATGCCCGCTA CGGGAATTTG CCGGACGATC AGCCGGCCTA TGCGTATGGC GATCGCGTTC CGCACGATGC TTATGAAGGC CAGTATCCGC GCGAGACCTA TCCGGAAGCG CCCTTCGGCG ACGAGAACGG CTATGGCGAA TCTTACGGCT ACGGGGAGGC CGAAGATGAA CCGGCTCCCG GCCGTCGCGG AAAAATGGCG ACGGTGGTCG CGGTTCTGGC GCTCGCGGTG GTCGGCACCG GCGCGGCGTA TGCTTATCGC ACCTTTGTCG GGAACGCGCC TAACGGTGAG CCGCCGATCA TCAGGGCCGA TACCGAACCC AACAAGATCG TGCCTCAGCA GACGACTTCG GGTGATGCCG TCGGCAAGTT GCTTCAGGAC CGGATGTCCG TGGAAAACGG CACCGAGCAG GTGGTGTCTC GCGAAGAGCA GCCGGTCGAT GTCAAAAGCG CCGTGAGCGC CGGCCCGCGT GTGGTTTTTC CGCCACTGAA CCAGAACGAT AACCCGCCGT CCGCGGCTAG CGTGGCTCCG GATATCAAGC CTCCCGCGAC CGTTGCAAAC AGCGCGCTCG GGGGCGACGA GCCGCGCAAG GTCAGGACCC TCGCGGTCCG CGGTGATCAG GCCGATCTCG CCGCCGCGTC CGCCGCCAGC CGGCATGCCG CTGCGGCGGG GGCCGCGCCC GGAGCCAATG TGCCGCTCTC GCAGACCGCC GACACGCGAA CCCGGATGGC GTCGACCCAT CCGGCGCAGC AGTCGTCCGC CAGCGGGGGC TATCTGGTCC AGGTTTCCTC GCAAAGGAAT GAGGCGGACG CGCAGGCGTC GTTCCGGGTG CTACAGGGTA AATTCCCGTC GATGCTGGGA TCGCGTACGC CTGTGATCAA GCGCGCCGAC CTCGGCGCCA AGGGTGTCTA CTATCGCGCA ATGGTCGGGC CGTTCGCCTC GTCCGGCGAG GCATCGCGTT TCTGCGGAAG CCTCAAATCG GCCGGAGGAC AGTGCGTCGT CCAAAGGAAT TAA
|
Protein sequence | MAHRYQDRPF PEDDDYGRDD QRATAPTEPD PLAELARLIG QGDSFGEPRR ETPMAPVDDP VDHFEPLPPI EEPIEGEPPA GPPSWIQRRM ARPDPSPEQL ESSAHPVLRR AAVYPDSHQP EPESYQADPG RYDDARYGNL PDDQPAYAYG DRVPHDAYEG QYPRETYPEA PFGDENGYGE SYGYGEAEDE PAPGRRGKMA TVVAVLALAV VGTGAAYAYR TFVGNAPNGE PPIIRADTEP NKIVPQQTTS GDAVGKLLQD RMSVENGTEQ VVSREEQPVD VKSAVSAGPR VVFPPLNQND NPPSAASVAP DIKPPATVAN SALGGDEPRK VRTLAVRGDQ ADLAAASAAS RHAAAAGAAP GANVPLSQTA DTRTRMASTH PAQQSSASGG YLVQVSSQRN EADAQASFRV LQGKFPSMLG SRTPVIKRAD LGAKGVYYRA MVGPFASSGE ASRFCGSLKS AGGQCVVQRN
|
| |