Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4958 |
Symbol | |
ID | 6412650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5338389 |
End bp | 5340086 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714841 |
Product | 5'-Nucleotidase domain protein |
Protein accession | YP_001993922 |
Protein GI | 192293317 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.416285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCTCGA GGCGGGAATT CCTGCAGGCG ACGGCCGCCG CATCGGCGCT GACGATCGCC GGCGGCCTGG GGCCGATCGG GCGGGTCGCG GCCCAGCAGC GGCTGACCCA GGGCGACATC CTGAAATTCG ATCCGCTCGG CACGGTGACG CTGCTGCATA TCACCGACAC GCACGCGCAA CTCGTGCCGC TGCATTTCCG CGAGCCCTCG GTCAATCTCG GTGTCGGCGA GGTCAAGGGC AAGCCGCCGC ATCTCACGGA CGAGGAATTC CGCAAGTACT TCCATATCGC CACCGGCTCG CCGGATGCGT TCGCGCTGAC CGCGGACGAC TTCACCGCGC TTGCCCGCAA CTACGGCAAG ATGGGCGGCT TCGACCGGAT CGCCACGCTG GTCAAGGCCA TCCGCGCCGA GCGCGGCGCC GACAAGGTGC TGCTGCTCGA CGGCGGCGAC GCGCTGCAGG GCAGCTGGAG CTCGCTGAAG AGCAACGGTC AGGACATGAT CGACGCGCTC GCCGGGCTCA AAGTCGACGC GATGACCGGC CATTGGGAGT TCACCTACGG CGCCGACCGC GTCAAGGAAA TCGCCGAGAA GGCGCCGTTC GCGTTCCTGG CGCAGAACGT CCGCGATATC GAATGGCAGG AGCCGGTGTT CGAGGCCCGC AAGATGTTCG AGCGCGGCGG CGTCAAGATC GCGGTGATCG GCCAGGCGTT GCCGCGCACC GCGGTCGCCA ATCCGCGCTG GATGTTTCCG AACTGGGAGT TCGGCATCCG CGAGGAGGAC ATGCAGAAAC AGGTCGACGA TGCGCGCGCC GAGGGCGCCG CGATCGTGGT GCTGCTGTCG CACAACGGCT TCGACGTCGA TCGCAAGCTC GCCGGCCGGG TGAAGGGCCT CGACGTCATC CTCACCGGCC ACACCCACGA CGCGATGCCG GGCGTGATCA AGGTCGGCGA AACCGTGCTG GTGGCGTCGG GCTCGCACGG CAAGTTCGTG TCGCGGCTCG ACATCAAGGT CGACGGCGGC AAGGTCGCGG ACATCCGCTT CAAGCTGATG CCGGTGTTTG CGGATGCGAT CACGCCAGAC CCGGAGATGG CCAAGCTGGT CGAGAAGCTG CGCGAGCCTT ACGCCAAGGA TCTCGCCCGC GTCGTCGGCA AGACCGACTC GCTCTTGTAT CGCCGCGGCA ATTTCAACGG CACCTTCGAT GATTTGATCT GCGACGCGAT GCTGAAGCAG CGCGACACCG AAATCGCGCT GTCGCCGGGC TTCCGCTGGG GCGGCACACT GCTGCCGGAA GAGGGCATCA CCTGGGAGGC GATCACCAAC GCCACCGCGA TCACCTATCC GAACTGCTAC CGCACGGAGA TGACCGGCGA GCAGCTCAAG AACGTGCTCG AAGACATCGC CGACAACATC TTCCATCCCG ACCCTTACTA TCAGGGCGGC GGCGACATGG TGCGCACTGG CGGCATGGGC TACGCGATCG ACATCTCCAA GGAGATGGGC TCGCGCATCT CCAACATGAC GCATCTGGCA ACCGGCAAGC CGATCGAGGC GTCGAAGAAG TACACGGTGT CCGGCTGGGC CAGCGTCAAT CAGGGCACCG AAGGTCCGCC GATCTGGGAG GTGCTGGAGA AGCACGTCGC CAGCGCCGGC CCGGTGAAGA TCGAACCGAA CAGCGCGGTC AAAGTCTCCG GTGCCTGA
|
Protein sequence | MISRREFLQA TAAASALTIA GGLGPIGRVA AQQRLTQGDI LKFDPLGTVT LLHITDTHAQ LVPLHFREPS VNLGVGEVKG KPPHLTDEEF RKYFHIATGS PDAFALTADD FTALARNYGK MGGFDRIATL VKAIRAERGA DKVLLLDGGD ALQGSWSSLK SNGQDMIDAL AGLKVDAMTG HWEFTYGADR VKEIAEKAPF AFLAQNVRDI EWQEPVFEAR KMFERGGVKI AVIGQALPRT AVANPRWMFP NWEFGIREED MQKQVDDARA EGAAIVVLLS HNGFDVDRKL AGRVKGLDVI LTGHTHDAMP GVIKVGETVL VASGSHGKFV SRLDIKVDGG KVADIRFKLM PVFADAITPD PEMAKLVEKL REPYAKDLAR VVGKTDSLLY RRGNFNGTFD DLICDAMLKQ RDTEIALSPG FRWGGTLLPE EGITWEAITN ATAITYPNCY RTEMTGEQLK NVLEDIADNI FHPDPYYQGG GDMVRTGGMG YAIDISKEMG SRISNMTHLA TGKPIEASKK YTVSGWASVN QGTEGPPIWE VLEKHVASAG PVKIEPNSAV KVSGA
|
| |