Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A1970 |
Symbol | |
ID | 3835394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 2277487 |
End bp | 2279214 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637826069 |
Product | PTS fructose IIC component |
Protein accession | YP_427057 |
Protein GI | 83593305 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1299] Phosphotransferase system, fructose-specific IIC component [COG1445] Phosphotransferase system fructose-specific component IIB |
TIGRFAM ID | [TIGR00829] PTS system, fructose-specific, IIB component [TIGR01427] PTS system, fructose subfamily, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.239904 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGT TTATCGCGGT CGTCGGCGGG GGCGAGCGGA GCACCCAGGC GCTGCTGGTC GCCGAGGCCC TGCGCCGGGC CGCCAGCGGG GCGGGGCACC GCATCGATAT CGAGGTGCGC AGCGATCAGG GCGTGGTCAA CGCCTTGAGC GACGACGCCA TCGTGCGCGC CAGGGCGGTG ATCCTGATCG GCACCGGCGA TCTGGATGTC GGGCGCTTTC CCGGGTTGCC CAAGCTTGAA ACCACCATCG AGGCGGTGCT GGCCGATGTG GGGGCGGTTC TCGGGCGGGC CGAGGGGGCC GAATTGGCCG GAAGCGTCGC CGCCACTGAC GACGGGGTGC GGCGCATCGT CGCCATCACC TCCTGCCCGA CCGGCATCGC CCATACCTTC ATGGCCGCCG AGGGCATCAC CCAGGCGGCC AAGGCTTTGG GCTATCAGGC CAGGGTCGAG ACCCAGGGCT CGGTGGGATC GCGCGACACC TTAAGCGACG ACGAGATCGC CCAGGCCGAT ATCGTGCTGA TCGCCGCCGA TACCCAGATC GATCTGGCGC GCTTCGACGG CAAGCGGGTG TTCCTGTCGG GCACCAAGCC GGCGATCGCC GATGGCAAGG CGCTGATCGC CCGCGCCCTG GCCGAGGCGA CGCTGCAAGG CGGCAAGAAA TCCCTGGTCG ATACGGTGGA GGCGGGCAAG GCCCAGCGCT CGGCCCAGCG CACGGGGGCC TATAAGCACC TGATGACCGG CGTGTCCTTC ATGCTGCCCT TCGTCGTCGC CGGCGGGTTG CTGATCGCCC TGGCCTTCGC CTTGGGCGGC ATCAACGCCT TTGACGAGGC CAATAAGGGA ACGCTGGCCT ATGACCTGTT CCAGATCGGC GCCAAAAGCG CCTTCGTGCT GATGGTTCCG GCCCTGGCCG GCTATATCGC CTTTTCCATC GCCGACCGGC CGGGCATCGC CCCGGGCATG ATCGGCGGTC TGGTCGCGGC CAATCTGGGG GCGGGTTTCC TGGGCGGCAT CATCGCCGGC TTCATCGCCG GCTATGTCAC GGCTTGGTTC AATCGCACCA TCCGGCTGCA CCGCAATCTT GAAGGGTTGA AGCCGGTCCT GATCCTGCCC GTGCTGGGCA CCGTCGTGAC CGGCTTGCTG ATGATCTATG TCGTCGGCAC GCCGGTGGCC GGGCTGTTGT CGTGGTTGAC CGAGGCCTTG CGCGGCATGC AGGGCAGCAG CGCCATCGTG CTCGGGCTGT TGATCGGGGC GATGATGGCC TTCGATATGG GCGGCCCGGT CAATAAGGCG GCCTATGCCT TTTCCACCGG CCTGCTGGCC AGCGAGGTCT ATACGCCGAT GGCGGCGGCC ATGGTGGCGG GGATGACCCC GGCCCTGGGC GTGGCCCTGG CCAGCCGGCT GTTTCGCAGC CGCTTCACCG CCAGCGAACG CGAGGCCGGC GGGGCGGCGG CGGTGCTGGG GCTGGCCTTC ATCACCGAAG GCGCCATTCC CTTCGCTGCC TCCGATCCCT TCCGCGTCAT TCCGGCGCTG ATGGCCGGAT CGGCGGCGGC CGGGGCGATC TCGATGACGG TGGGAGCGGA GCTGAAAGTC CCCCACGGCG GCATTTTCGT GCTGCCGATC CCCAATGCCG TCACCCATTT GCTGGGCTAT ATCGTCGCCC TGGTCGTTGG CACCCTGATC ACCGCCCTGG TGCTGCGGAT CACCAAACGC CCGGTGGCTG TTCTTTAG
|
Protein sequence | MSQFIAVVGG GERSTQALLV AEALRRAASG AGHRIDIEVR SDQGVVNALS DDAIVRARAV ILIGTGDLDV GRFPGLPKLE TTIEAVLADV GAVLGRAEGA ELAGSVAATD DGVRRIVAIT SCPTGIAHTF MAAEGITQAA KALGYQARVE TQGSVGSRDT LSDDEIAQAD IVLIAADTQI DLARFDGKRV FLSGTKPAIA DGKALIARAL AEATLQGGKK SLVDTVEAGK AQRSAQRTGA YKHLMTGVSF MLPFVVAGGL LIALAFALGG INAFDEANKG TLAYDLFQIG AKSAFVLMVP ALAGYIAFSI ADRPGIAPGM IGGLVAANLG AGFLGGIIAG FIAGYVTAWF NRTIRLHRNL EGLKPVLILP VLGTVVTGLL MIYVVGTPVA GLLSWLTEAL RGMQGSSAIV LGLLIGAMMA FDMGGPVNKA AYAFSTGLLA SEVYTPMAAA MVAGMTPALG VALASRLFRS RFTASEREAG GAAAVLGLAF ITEGAIPFAA SDPFRVIPAL MAGSAAAGAI SMTVGAELKV PHGGIFVLPI PNAVTHLLGY IVALVVGTLI TALVLRITKR PVAVL
|
| |