Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0590 |
Symbol | |
ID | 4021058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 662255 |
End bp | 663403 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637960777 |
Product | hypothetical protein |
Protein accession | YP_567729 |
Protein GI | 91975070 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGG ATACGCTGAC CACCCACGCC GGCATCGCCG TTCACGGCGC CGAGCGACTG CCGTCCGTCG ACGTCGATAG CTACAATATC GAGCTCAAGG ACGATGACGG ATTTCTCGGC GACCGCGCCA GCAAGGGCGC ATTCCAGCGC ATCCTCGACG CCTCCCGCAA GCCGCTGCGC AAGAACGGCG AAGATCCGCT GGGCAAGAAG GCGACCGAGG AAATCTCCAA GGGGACGCTC GATGATATCC TGAAAGGCGA CGACGTCGGT GCGGCTGCGC TGGTGCACGG CGCGATCGAG GAATTCGCCC AGGAATTGGC CTTCGTCACC CGCCGCTTCC TCAAGACCAA GGCCTGGGCC GACACCGAAT GCATCGTTGT CGGCGGCGGC TTCCGCCAGA GCCGGGTCGG CGAGCTGGCG ATCGCGCGCA CCGACATCCT GCTCAAATCC GAGGGGCTGA AGGTCGATCT GGTGCCGATC CGGTTCGATC CGGACGAAGC CGGGCTGATC GGCTGCCTGC ATCTGGCGCC GTCGTGGATT TTCGAAGGCC ACGACAGCAT TCTGGCGGTC GATATCGGCG GCTCGAACAT CCGCTGCGGC GTGGTCGAGA CCTTCTGGAA GAAGGCGCCG GATCTGTCGA AGGCCTCGGT CTGGAAAGTC GAGCTGTGGC GGCACGCCGA GGACGAGCCG ACCCGCGAAG GCGCGGTGAA GCGGCTGACC AAGATGCTCA AGGACCTGAT CGCAGACGCC GAGAAGGAGG GCTTCAAGCT CGCGCCGTTC ATCGGGATTT CCTGCCCGGG CGTGATCAAT GCCGACGGCT CGATCGAGAA GGGCGCCCAG AACCTGCCGG GCAATTGGGA AAGCAGCAAA TTCCACCTGC CGCGCAGCCT GATCGAGGGC ATCCCGATGA TCGGCGACCA CGACACCGCG ATCCTGATGC ACAATGACGG CGTCGCCCAG GGGCTGAGCG AAGTGCCGTT CATGCAGGAG TTCGACCGCT GGGGCGTGCT GACGATCGGC ACCGGCCTCG GCAACGCCCG CTTCACCAAC CGGAAAACCA AGGAGAAGGC CAAAAAGGAC AAGGCCAAGG ACGACAAGCC CAAGGAGAAG GAAAAAGAGA AGGACAAGGA GCGCAAGGAG AAGGCGTAA
|
Protein sequence | MAEDTLTTHA GIAVHGAERL PSVDVDSYNI ELKDDDGFLG DRASKGAFQR ILDASRKPLR KNGEDPLGKK ATEEISKGTL DDILKGDDVG AAALVHGAIE EFAQELAFVT RRFLKTKAWA DTECIVVGGG FRQSRVGELA IARTDILLKS EGLKVDLVPI RFDPDEAGLI GCLHLAPSWI FEGHDSILAV DIGGSNIRCG VVETFWKKAP DLSKASVWKV ELWRHAEDEP TREGAVKRLT KMLKDLIADA EKEGFKLAPF IGISCPGVIN ADGSIEKGAQ NLPGNWESSK FHLPRSLIEG IPMIGDHDTA ILMHNDGVAQ GLSEVPFMQE FDRWGVLTIG TGLGNARFTN RKTKEKAKKD KAKDDKPKEK EKEKDKERKE KA
|
| |