Gene Rru_A1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1547 
Symbol 
ID3834962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1822708 
End bp1824288 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content73% 
IMG OID637825637 
Producthypothetical protein 
Protein accessionYP_426634 
Protein GI83592882 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.831986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCTTA CGCAATCCGG GCCGAGGTTT TGGCAAAGCG CTCTCCCCTT GCCCGGGGAG 
CTGGGGGGGC GGGCGCGTCC GGTGCTTGGC GTTGCCGAGA TGGCGGCGGC CGATCAGGCG
GCGGCCGCCG CCGGCCGGCC GGGGCTTGTC CTGATGGAGG CCGCCGGCGC GGCGGTGGTC
CGCGAGATCG CCGCGCGCTG GTCGAAGCGG CCGGTCCGCG TGTTGTGTGG CCCCGGCAAT
AACGGCGGCG ATGGCTATGT CATCGCCCGG CTTCTGGCGG CGCGCGGCTG GCCGGTGCGG
GTGATGGCCC TTGAGGGGGC GCCGCCCCCG GGCGGTGACG CGGCGGGGAT GGCCCACCTG
TGGCGGGGGC GGGTAGACCC CATGACGGCG GAGGACTTGC GGCCCGGCGA TCTGGTGGTA
GACGCCCTGT TTGGCGCCGG GCTGTCGCGG CCTTTGGCCG GGGCGGCGGC CGAGGCCGTG
GCGCGGATCA ACGCCCTTGG CCTGACCTGC GTCGGCGTTG ATGTGCCAAG CGGCGTCGAT
GGCGACAGCG GACGGATTTT GGGTGCGGCG CCCTTTTGCG CCCTTACGGT CACCTTCTTC
CATCCCAAGC CCGGTCATCT GCTGGTTCCG GCGCGCGAGC GGATCGGCGA GTTGGTGATC
GCCGATATCG GCCTTCCCGA AACGGTTCTA GACGCCGCGC CACCGCGCGC CTTCGTCAAT
GGCCCGGGGC TGTGGACCTT GCCGCGCCCG GCGGTCGAGG GCCATAAATT CGCCCGCGGT
CACGCGGTGG TGATCGGCGG CGCCCGGATG ACCGGGGCGG CGCGGCTGGC GGCGCGGGCC
TGTCGGCGGG TTGGCGCCGG CTTGCTGACC ATCGCCTGCG CCGAGGAAGC GCGGCTGATC
TACGCCCTTG ACCAACCCGG GGCGATGGTT TGGGGGATCG GGGGCGAGGG GCCGATCGCC
CGCCTGCTGG ACGATCCCCG GCGCAACGCC TTTTTGCTGG GGCCGGGCTA TGGACGCGGA
GCCGAAACCG CTGCCCTGGC GTTGATGCTG GCCAAAGGCG GACGCGCCCT GGTTTTGGAT
GCCGATGCCC TGACCAGTCT TTCGGGGAAA CTTGAGGAGT TTTCAAGAAC GCTTTGTTAT
GATTGTGTTC TTACGCCCCA CGAGGGAGAA TTCAGGGCGC TGTTCGCCGC CGCCTTGGGG
GCGGAGGCCG CCCCGGAACG CGGGCGCTTG GCGCGGGCGC GGGCGGCGGC GCGGGCCAGT
GGCGCCGTGG TGGTGCTCAA GGGGCCCGAT ACCGTGATCG CCGCCGCCGA TGGCCGGGCG
GCGATCAGCG TCGGCGCGCC GGCCGATCTG GCGACGGCGG GCAGCGGCGA TGTTCTGGCC
GGGCTGGTGC TTGGCTTGCT GGCCCAGGGA TTGCCCGGCT TCGAAGCGGC GGCGGCGGCG
GTTTGGCTGC ATGGCGCCGC CGGCCGCGCC GCCGGTCCGG GGCTGATCGC CGAGGATCTG
CCCGAAGCCC TGCCCGCGCT CCTCGCCGCG CTCCGCTCCG CTCCCTCGCC GAACCGCCGA
CCCACCGATC GAACCGTCTG A
 
Protein sequence
MPLTQSGPRF WQSALPLPGE LGGRARPVLG VAEMAAADQA AAAAGRPGLV LMEAAGAAVV 
REIAARWSKR PVRVLCGPGN NGGDGYVIAR LLAARGWPVR VMALEGAPPP GGDAAGMAHL
WRGRVDPMTA EDLRPGDLVV DALFGAGLSR PLAGAAAEAV ARINALGLTC VGVDVPSGVD
GDSGRILGAA PFCALTVTFF HPKPGHLLVP ARERIGELVI ADIGLPETVL DAAPPRAFVN
GPGLWTLPRP AVEGHKFARG HAVVIGGARM TGAARLAARA CRRVGAGLLT IACAEEARLI
YALDQPGAMV WGIGGEGPIA RLLDDPRRNA FLLGPGYGRG AETAALALML AKGGRALVLD
ADALTSLSGK LEEFSRTLCY DCVLTPHEGE FRALFAAALG AEAAPERGRL ARARAAARAS
GAVVVLKGPD TVIAAADGRA AISVGAPADL ATAGSGDVLA GLVLGLLAQG LPGFEAAAAA
VWLHGAAGRA AGPGLIAEDL PEALPALLAA LRSAPSPNRR PTDRTV