Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A1547 |
Symbol | |
ID | 3834962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 1822708 |
End bp | 1824288 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637825637 |
Product | hypothetical protein |
Protein accession | YP_426634 |
Protein GI | 83592882 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.831986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCTTA CGCAATCCGG GCCGAGGTTT TGGCAAAGCG CTCTCCCCTT GCCCGGGGAG CTGGGGGGGC GGGCGCGTCC GGTGCTTGGC GTTGCCGAGA TGGCGGCGGC CGATCAGGCG GCGGCCGCCG CCGGCCGGCC GGGGCTTGTC CTGATGGAGG CCGCCGGCGC GGCGGTGGTC CGCGAGATCG CCGCGCGCTG GTCGAAGCGG CCGGTCCGCG TGTTGTGTGG CCCCGGCAAT AACGGCGGCG ATGGCTATGT CATCGCCCGG CTTCTGGCGG CGCGCGGCTG GCCGGTGCGG GTGATGGCCC TTGAGGGGGC GCCGCCCCCG GGCGGTGACG CGGCGGGGAT GGCCCACCTG TGGCGGGGGC GGGTAGACCC CATGACGGCG GAGGACTTGC GGCCCGGCGA TCTGGTGGTA GACGCCCTGT TTGGCGCCGG GCTGTCGCGG CCTTTGGCCG GGGCGGCGGC CGAGGCCGTG GCGCGGATCA ACGCCCTTGG CCTGACCTGC GTCGGCGTTG ATGTGCCAAG CGGCGTCGAT GGCGACAGCG GACGGATTTT GGGTGCGGCG CCCTTTTGCG CCCTTACGGT CACCTTCTTC CATCCCAAGC CCGGTCATCT GCTGGTTCCG GCGCGCGAGC GGATCGGCGA GTTGGTGATC GCCGATATCG GCCTTCCCGA AACGGTTCTA GACGCCGCGC CACCGCGCGC CTTCGTCAAT GGCCCGGGGC TGTGGACCTT GCCGCGCCCG GCGGTCGAGG GCCATAAATT CGCCCGCGGT CACGCGGTGG TGATCGGCGG CGCCCGGATG ACCGGGGCGG CGCGGCTGGC GGCGCGGGCC TGTCGGCGGG TTGGCGCCGG CTTGCTGACC ATCGCCTGCG CCGAGGAAGC GCGGCTGATC TACGCCCTTG ACCAACCCGG GGCGATGGTT TGGGGGATCG GGGGCGAGGG GCCGATCGCC CGCCTGCTGG ACGATCCCCG GCGCAACGCC TTTTTGCTGG GGCCGGGCTA TGGACGCGGA GCCGAAACCG CTGCCCTGGC GTTGATGCTG GCCAAAGGCG GACGCGCCCT GGTTTTGGAT GCCGATGCCC TGACCAGTCT TTCGGGGAAA CTTGAGGAGT TTTCAAGAAC GCTTTGTTAT GATTGTGTTC TTACGCCCCA CGAGGGAGAA TTCAGGGCGC TGTTCGCCGC CGCCTTGGGG GCGGAGGCCG CCCCGGAACG CGGGCGCTTG GCGCGGGCGC GGGCGGCGGC GCGGGCCAGT GGCGCCGTGG TGGTGCTCAA GGGGCCCGAT ACCGTGATCG CCGCCGCCGA TGGCCGGGCG GCGATCAGCG TCGGCGCGCC GGCCGATCTG GCGACGGCGG GCAGCGGCGA TGTTCTGGCC GGGCTGGTGC TTGGCTTGCT GGCCCAGGGA TTGCCCGGCT TCGAAGCGGC GGCGGCGGCG GTTTGGCTGC ATGGCGCCGC CGGCCGCGCC GCCGGTCCGG GGCTGATCGC CGAGGATCTG CCCGAAGCCC TGCCCGCGCT CCTCGCCGCG CTCCGCTCCG CTCCCTCGCC GAACCGCCGA CCCACCGATC GAACCGTCTG A
|
Protein sequence | MPLTQSGPRF WQSALPLPGE LGGRARPVLG VAEMAAADQA AAAAGRPGLV LMEAAGAAVV REIAARWSKR PVRVLCGPGN NGGDGYVIAR LLAARGWPVR VMALEGAPPP GGDAAGMAHL WRGRVDPMTA EDLRPGDLVV DALFGAGLSR PLAGAAAEAV ARINALGLTC VGVDVPSGVD GDSGRILGAA PFCALTVTFF HPKPGHLLVP ARERIGELVI ADIGLPETVL DAAPPRAFVN GPGLWTLPRP AVEGHKFARG HAVVIGGARM TGAARLAARA CRRVGAGLLT IACAEEARLI YALDQPGAMV WGIGGEGPIA RLLDDPRRNA FLLGPGYGRG AETAALALML AKGGRALVLD ADALTSLSGK LEEFSRTLCY DCVLTPHEGE FRALFAAALG AEAAPERGRL ARARAAARAS GAVVVLKGPD TVIAAADGRA AISVGAPADL ATAGSGDVLA GLVLGLLAQG LPGFEAAAAA VWLHGAAGRA AGPGLIAEDL PEALPALLAA LRSAPSPNRR PTDRTV
|
| |