Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3452 |
Symbol | |
ID | 4898292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 523832 |
End bp | 525274 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640114049 |
Product | hypothetical protein |
Protein accession | YP_001045317 |
Protein GI | 126464204 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0700524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.443587 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTGG CTCCGCCCGA GAGCGTCGCC GGCGAACATT TCGATGTGAT TATTGTGGGC TCGGGCTTCG GCTCCTCGTT CTTCCTGCAT CGGCTGCTGC GCCAGCCGGG GCGGCGGGTC CTCGTCCTGG AATGGGGTCG GCATTCGACA CACGACTGGC AGCTGGAGGA AGGGCGGAAT TCGCCCGTCA CCGACAGCGA CACCTACGCC ACCAACTCCG AGAAGCCTTG GAACTTCACC ATCGGATTCG GCGGAGGCAC CAACTGCTGG TTCGCCCAGA CGCCGCGCCT GCATCCGGCG GATTTCCGGC TCGGGTCCGA TCACGGCGTG GCGCAGGACT GGCCCGTCTC CTACGACGAT CTCGAACCCT ACTGGTGCGA GGCCGAGGAG ATCATGGCGG TTTCGGGCGA TCCCGACATG GCGCAGGTCA TGCCGCGCTC GCGTCCCTTT CCGCAGCCGC CGCATGTGAT GCCCGATCCC GACCGGCTGA TGAAGGCGGC GCGGCCCGAC AGCCATTTCG TGATGCCGAC CGCCCGGGCC CGCATCGCCA CCGAGACGCG CGCGGCCTGC TGCGCCTCGC TGCGCTGCCA GATCTGCCCG GCGGATGCGA AATTCACCGC GAACAATTCG CTCGTGCCGC TCTACGAGAC GCCGGGCGTG ACCCTCTGCC TCGAGACGGA GGTGCGCCGG TTCGAGGCGG CGGGCTCGTC GATCTCGGCG GCGGTGATCC GCGGCCCCGA CGGGCGCGAG CATAAGGTGA CGGGCGATCT CTTCGTGCTC GGCGCCAACG CGATCCACAG CCCGGCGATC CTCCTGCGCT CGGATCTGGG CGGCGGGCTG ACCGGCGTGG GGCTGCACGA ATCCTACGGC TGGTCGATGG AGGCCTGGCT CGACGGGGTC GACAATTTCG GCGGCAGCAC CATCACGACG GGGCTCGACT TCGGCCTCTA CGACGGGCCG CACCGCAAGG ATCGGGGCGC GGCGCTGGTC TATTTCGAGA ATCGCTGGTC GCACGGGATG CGGCTCGGGG CCGAGCGGAT GCGCCAGACC CTGCCGCTCG TGATCGTGAC CGAGGACCTG CCCGAGGACC GCAACCGCGT GACGCTGGAT GGCGAGGGGC GGGCCTTCAT CGACTATCAC GGACCTTCGG ATTATGCGCT GCGCGGGATG GAGCGGGCCA AGGCCGCGCT GCCCGAGCTG CTCGCGCCGC TGCCGGTCGA GAAGATCCTC GACCACGGCA TCCGCGAGAC GGAAAGCCAC CTGCAGGGCA CGCTGCGGAT GGGATCCGAT CCGGCCACGT CCGTGGTGGA CGCGGGCCTT GTCCATCACC GGCTGCGCAA TCTGGTGGTG GTGGGCACCA GCACCTTCCC CACCTGCTCG GCCGCCAACC CTTCGCTCAC CGCCGCCGCG CTGTCGCTGC GCGCCGCCGA CCTTCTGATC TGA
|
Protein sequence | MNLAPPESVA GEHFDVIIVG SGFGSSFFLH RLLRQPGRRV LVLEWGRHST HDWQLEEGRN SPVTDSDTYA TNSEKPWNFT IGFGGGTNCW FAQTPRLHPA DFRLGSDHGV AQDWPVSYDD LEPYWCEAEE IMAVSGDPDM AQVMPRSRPF PQPPHVMPDP DRLMKAARPD SHFVMPTARA RIATETRAAC CASLRCQICP ADAKFTANNS LVPLYETPGV TLCLETEVRR FEAAGSSISA AVIRGPDGRE HKVTGDLFVL GANAIHSPAI LLRSDLGGGL TGVGLHESYG WSMEAWLDGV DNFGGSTITT GLDFGLYDGP HRKDRGAALV YFENRWSHGM RLGAERMRQT LPLVIVTEDL PEDRNRVTLD GEGRAFIDYH GPSDYALRGM ERAKAALPEL LAPLPVEKIL DHGIRETESH LQGTLRMGSD PATSVVDAGL VHHRLRNLVV VGTSTFPTCS AANPSLTAAA LSLRAADLLI
|
| |