Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1936 |
Symbol | |
ID | 4022418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2173810 |
End bp | 2175420 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637962129 |
Product | protein of unknown function DUF853, NPT hydrolase putative |
Protein accession | YP_569072 |
Protein GI | 91976413 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.495042 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGA GCGAAACCGG CCACGACGAT ATCGATGGCA AGATCTTCAT CGGCAAGGGC GAGCAGCCGG CATGGCTCAC GCTCGGCCTC GCCAATCGCC ACGGCCTCGT CACCGGCGCC ACCGGCACCG GCAAGACAGT GTCGCTGCAG GTGATGGCCG AAGGCTTTGC GCGCGCCGGC GTGCCGGTGT TCGCCGCCGA CATCAAGGGT GATCTCTCGG GCATCGCCGA AACCGGCGAG GCGAAAGATT TCATCCTCAA GCGCGCGAAG GAGATGGGTC TGGCTTTTCA GCCTGATCAG TTCAGCACAG TGTTCTGGGA CGTGTTCGGC GAGCAGGGCC ATCCGGTGCG GGCCACTGTC TCGGAGATGG GACCGCTGCT GCTGTCGCGG ATGCTCGATC TCAACGACGT GCAGGAAGGT GTGCTCAACG TCGCGTTCCG TGTCGCCGAC GACATGGGCC TGCCGCTGGT CGACATGAAG GATCTGCGCG CGATGCTCGA TGCGATCGCG CCGATCGCCG CGAAGGTTGC CGAGAACGGC GACGTCAACG CCGACATCAG ACAGGCGGCG CAGGCGCTCG GCAACGTCAC CAAGCAGACT GTCGGCACCA TTCAGCGTCA GCTGCTCGTG CTGGAGAATC AGGGCGGCGA GAGTTTCTTC GGCGAGCCCG CATTGCAGTT GAAGGACTTC ATCCGCACCG ACAATCAGGG CCGCGGCCTC GTCAACATCC TGGTCGCAGA CAAGCTGATG ACCAATCCCA GGTTGTACGC GACCTTCCTG CTGTGGATGT TGGCGGAGCT GTTCGAGGAG TTGCCCGAAG TCGGCGATCC GGACAAGCCG AAGCTGGTGT TCTTCTTCGA CGAGGCGCAT CTGCTGTTTA ACGACGCGCC GAAGCCGCTG ATGGATAAAA TTGAACAGGT CGTGCGACTG ATCCGCTCCA AGGGCGTCGG CGTCTACTTC GTCACGCAGA ACCCGATCGA CGTGCCGGAT CGCGTGCTGG CGCAACTCGG CAACCGGGTG CAGCACGCGT TGCGCGCATT CACCCCGCGC GACCAGAAGG CGGTCGCGGC GGCGGCGACC ACGTTCCGAC CCAATCCCAA GCTCGACACC ACCAAGGCGA TCACCGAACT CGGCAAAGGC GAGGCGCTGG TGTCGTTCCT CGAAGGCAAC GGCACGCCGG CGATGGTCGA GCGCGTGATG ATCCGACCGC CCGCGGCCCG TATCGGGCCG ATCACGCCGG AGGAGCGCAA GGCGATCATC GCCGCGAGCC CGGTGAGGGG AAAGTACGAC ACCGCAATCG ATTCCGACTC CGCCTATGAG AAGTTGCGCG ATCGCATTGA GAACAAGAAT GCGGGCGCCG AAGGCGCGCC GGCCGAAGGC GGCATTCTCG GCCAGCTCGG CAGCATCGTC TCGACCGTGT TCGGCACCAG CGCGCCGCGC GGCAAGCTCA CCACCGGGCA GGTGGTGGCG CGCAATGTCG CGCGCAGCGT CACCAACACG GTGATCGGCG GCATCGCCGC CGATCTCGGC AAGCGCGTCG GCGGTTCGCT CGGCGGATCG GTCGGGCGCT CGATCGTCCG CGGTACGCTC GGCAGTCTGC TGCGCCGCTG A
|
Protein sequence | MTASETGHDD IDGKIFIGKG EQPAWLTLGL ANRHGLVTGA TGTGKTVSLQ VMAEGFARAG VPVFAADIKG DLSGIAETGE AKDFILKRAK EMGLAFQPDQ FSTVFWDVFG EQGHPVRATV SEMGPLLLSR MLDLNDVQEG VLNVAFRVAD DMGLPLVDMK DLRAMLDAIA PIAAKVAENG DVNADIRQAA QALGNVTKQT VGTIQRQLLV LENQGGESFF GEPALQLKDF IRTDNQGRGL VNILVADKLM TNPRLYATFL LWMLAELFEE LPEVGDPDKP KLVFFFDEAH LLFNDAPKPL MDKIEQVVRL IRSKGVGVYF VTQNPIDVPD RVLAQLGNRV QHALRAFTPR DQKAVAAAAT TFRPNPKLDT TKAITELGKG EALVSFLEGN GTPAMVERVM IRPPAARIGP ITPEERKAII AASPVRGKYD TAIDSDSAYE KLRDRIENKN AGAEGAPAEG GILGQLGSIV STVFGTSAPR GKLTTGQVVA RNVARSVTNT VIGGIAADLG KRVGGSLGGS VGRSIVRGTL GSLLRR
|
| |