Gene Rru_A1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1970 
Symbol 
ID3835394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2277487 
End bp2279214 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content68% 
IMG OID637826069 
ProductPTS fructose IIC component 
Protein accessionYP_427057 
Protein GI83593305 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1299] Phosphotransferase system, fructose-specific IIC component
[COG1445] Phosphotransferase system fructose-specific component IIB 
TIGRFAM ID[TIGR00829] PTS system, fructose-specific, IIB component
[TIGR01427] PTS system, fructose subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.239904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGT TTATCGCGGT CGTCGGCGGG GGCGAGCGGA GCACCCAGGC GCTGCTGGTC 
GCCGAGGCCC TGCGCCGGGC CGCCAGCGGG GCGGGGCACC GCATCGATAT CGAGGTGCGC
AGCGATCAGG GCGTGGTCAA CGCCTTGAGC GACGACGCCA TCGTGCGCGC CAGGGCGGTG
ATCCTGATCG GCACCGGCGA TCTGGATGTC GGGCGCTTTC CCGGGTTGCC CAAGCTTGAA
ACCACCATCG AGGCGGTGCT GGCCGATGTG GGGGCGGTTC TCGGGCGGGC CGAGGGGGCC
GAATTGGCCG GAAGCGTCGC CGCCACTGAC GACGGGGTGC GGCGCATCGT CGCCATCACC
TCCTGCCCGA CCGGCATCGC CCATACCTTC ATGGCCGCCG AGGGCATCAC CCAGGCGGCC
AAGGCTTTGG GCTATCAGGC CAGGGTCGAG ACCCAGGGCT CGGTGGGATC GCGCGACACC
TTAAGCGACG ACGAGATCGC CCAGGCCGAT ATCGTGCTGA TCGCCGCCGA TACCCAGATC
GATCTGGCGC GCTTCGACGG CAAGCGGGTG TTCCTGTCGG GCACCAAGCC GGCGATCGCC
GATGGCAAGG CGCTGATCGC CCGCGCCCTG GCCGAGGCGA CGCTGCAAGG CGGCAAGAAA
TCCCTGGTCG ATACGGTGGA GGCGGGCAAG GCCCAGCGCT CGGCCCAGCG CACGGGGGCC
TATAAGCACC TGATGACCGG CGTGTCCTTC ATGCTGCCCT TCGTCGTCGC CGGCGGGTTG
CTGATCGCCC TGGCCTTCGC CTTGGGCGGC ATCAACGCCT TTGACGAGGC CAATAAGGGA
ACGCTGGCCT ATGACCTGTT CCAGATCGGC GCCAAAAGCG CCTTCGTGCT GATGGTTCCG
GCCCTGGCCG GCTATATCGC CTTTTCCATC GCCGACCGGC CGGGCATCGC CCCGGGCATG
ATCGGCGGTC TGGTCGCGGC CAATCTGGGG GCGGGTTTCC TGGGCGGCAT CATCGCCGGC
TTCATCGCCG GCTATGTCAC GGCTTGGTTC AATCGCACCA TCCGGCTGCA CCGCAATCTT
GAAGGGTTGA AGCCGGTCCT GATCCTGCCC GTGCTGGGCA CCGTCGTGAC CGGCTTGCTG
ATGATCTATG TCGTCGGCAC GCCGGTGGCC GGGCTGTTGT CGTGGTTGAC CGAGGCCTTG
CGCGGCATGC AGGGCAGCAG CGCCATCGTG CTCGGGCTGT TGATCGGGGC GATGATGGCC
TTCGATATGG GCGGCCCGGT CAATAAGGCG GCCTATGCCT TTTCCACCGG CCTGCTGGCC
AGCGAGGTCT ATACGCCGAT GGCGGCGGCC ATGGTGGCGG GGATGACCCC GGCCCTGGGC
GTGGCCCTGG CCAGCCGGCT GTTTCGCAGC CGCTTCACCG CCAGCGAACG CGAGGCCGGC
GGGGCGGCGG CGGTGCTGGG GCTGGCCTTC ATCACCGAAG GCGCCATTCC CTTCGCTGCC
TCCGATCCCT TCCGCGTCAT TCCGGCGCTG ATGGCCGGAT CGGCGGCGGC CGGGGCGATC
TCGATGACGG TGGGAGCGGA GCTGAAAGTC CCCCACGGCG GCATTTTCGT GCTGCCGATC
CCCAATGCCG TCACCCATTT GCTGGGCTAT ATCGTCGCCC TGGTCGTTGG CACCCTGATC
ACCGCCCTGG TGCTGCGGAT CACCAAACGC CCGGTGGCTG TTCTTTAG
 
Protein sequence
MSQFIAVVGG GERSTQALLV AEALRRAASG AGHRIDIEVR SDQGVVNALS DDAIVRARAV 
ILIGTGDLDV GRFPGLPKLE TTIEAVLADV GAVLGRAEGA ELAGSVAATD DGVRRIVAIT
SCPTGIAHTF MAAEGITQAA KALGYQARVE TQGSVGSRDT LSDDEIAQAD IVLIAADTQI
DLARFDGKRV FLSGTKPAIA DGKALIARAL AEATLQGGKK SLVDTVEAGK AQRSAQRTGA
YKHLMTGVSF MLPFVVAGGL LIALAFALGG INAFDEANKG TLAYDLFQIG AKSAFVLMVP
ALAGYIAFSI ADRPGIAPGM IGGLVAANLG AGFLGGIIAG FIAGYVTAWF NRTIRLHRNL
EGLKPVLILP VLGTVVTGLL MIYVVGTPVA GLLSWLTEAL RGMQGSSAIV LGLLIGAMMA
FDMGGPVNKA AYAFSTGLLA SEVYTPMAAA MVAGMTPALG VALASRLFRS RFTASEREAG
GAAAVLGLAF ITEGAIPFAA SDPFRVIPAL MAGSAAAGAI SMTVGAELKV PHGGIFVLPI
PNAVTHLLGY IVALVVGTLI TALVLRITKR PVAVL