Gene Rru_B0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_B0044 
Symbol 
ID3833375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007641 
Strand
Start bp44416 
End bp46656 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content66% 
IMG OID637824063 
Productglycosyl transferase family protein 
Protein accessionYP_425080 
Protein GI83582774 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACTC TCCCCACCCC GCCGCGCCCC CGCCCCGTCG ATATCATCAT TCCGGTCTAT 
AAGGGGCTGG AGGAAACCAG GCTGTGCCTG GAGAGCGTTC TGGCGACGCT TCCCGCCGCC
GACGGTCTCA TCGTTATCGA TGACCACAGC CCCGACCCCG CCCTGGTCGC CTATCTGAAG
GACCGCGCGG CCGGCGATCG GCGGATAAGA CTGCTCCACA ACCCCGAAAA CCTTGGTTTC
GTCGGCACCG TCAACCGGGG CATGGCGTTG GAGCCCGAGC GCGACGTCCT GCTGCTCAAC
AGCGATACCG AGGTTGCCGG CGACTGGGTG GCGCGGCTGC GCGCGGCGGC CTATGCCGAT
CGGCGGATCG GCACGGTCAC GCCCTTTTCC AATAACGCGA CGATCTGCAG CTGGCCCCGG
TTCTGCCAGG ACAATCCTTT GCCCCCCGGG CTGGACGTGG CCGGGGTCGA CAGGGCCTTC
GCCGCCGCCA ATCCCGGACT GAGCATCGAT ATTCCAACCG GGGTGGGCTT CGCCTTTTAC
ATCCGCCGCG ATTGCCTTGA TGAGGTCGGC CTGTTCAATG CCGAGGCCTT CGGCAAGGGC
TATGGCGAGG AAAACGACTT CTGCCGCCGG GCCCATCACC GGGGGTGGCG CAACGTGCTC
GCCGCCGACA CCTTCGTTTA TCACTCGGGC AACGTCAGTT TCGGGGTCAA TCAGGAGCGT
CTGGATAGCG CCATGCGCCA GTTGCTGGCC CTTCATCCCG ATTATCGCCG GGTGGTCCAG
CTCCATATCA ACCAGGATCC GGCCCAAAAC ATGCGCTGGC GCGCCGCCCT CGGCATCCTG
CGCCAAAGCG CCTTGCCGGT GCTGCTGTTC GTTACCCATA ACCATGGCGG CGGCACCCTG
AGCCATGTCC ACGAACTGGC GAAGGCCCTT GAAGGGCGGG CCTGGGGGCT TTTGCTGACG
CCGGGGCCGC GTAATACCGC CGTTGTGACC TTACCCGCTT CCCTGGGCGG CGACGCCCTG
CCCTTCGATC TGGAGCAAGA CTGGGACGGG CTGCTTGATC TGTTGCGCTA TGCGGGGGTG
AACCGCCTGC ATCTTCACCA TATGCTGGGC GTTCCCGAAC GGCTTTTGGA TCTGCCCGAA
CAGCTTTCCA TTCCCTTTGA CTTCACCGCC CATGATTTTC ACGCTGGCTG TCCGCGGGTC
ATGCTCTGCG GTCCAGGCTC CCGCTATTGC GGCCAGCCCG AAGAGCGGGC GCTCTGCGAT
GCTTGTCTGG CCCAGGCGCC GAAGACCGAA GCCGGCGATA TCACGTCTTG GCGCGCGGCG
ATGGTCGCCC GTTTGAGCCG GGCCGAGCGG TTGTTCGCGC CCAGCGCCGA TACCGCGAAC
CGCCTGAAGC GGATGCTGCC CGCCCTCTCC TTTCGCGCCA TCCCCCATCC CGACGCCCAG
GGCCTCGCCA CCAACGCGCC GCCCCCTCTC CTGCGTCCCT GTGGGACGAC CGAACCCTTG
CGTATCCTGG TGCTTGGCGC CCTGAGCCGG GCCAAGGGCG CCGATCTGGT CGAAGCGACG
GCACGCGAGG CGGCCCGGGG CGATCTTCCC TTGGAAATCC ACCTTCTCGG CTATGGCTAC
CGACCGCTGC ATCGCGCGCG GGGGCGACTG ACCGCCCATG GCCGCTACCA CCCCGAGGAG
ATCGCCGGGC ATTTGGAGCG CATCGCTCCC CATGTGGCTT GGCTGCCGGC CGGTTGGCCG
GAAACCTACA GCTACACTTT GAGCGAGGTC ATGGCCGCCG GATTGCCGGT GGTGGTCAGC
GATCTTGGCG CCCCACCAGA GCGCATCCTC GGCCGCCCTT TGTCCTGGGT TCTGCCCTGG
AACGTCGATG CTTCCACAGC GGCGGCGTTC TTCGGGAGGC TGCGCGCCGG GGAAATCCCC
GCCTCCCCGG AGGCCCCTGC CCTCTCGCCG CAAAGCACGC CCCGGGTGGA TTTTTATCGC
GAAGGCTATC TTGCCGAGGT TTTTCCGGAG AAAGCACGCG CGCGCCTTCC AGAGCGGGAA
ATCGGCGAGC TTATTGGGCA AGCCCTCGAT CGCGCCGGGG CGCGCCGGCG ACGCCTACAG
TGTTGGGGCG ATCTGAAAGT CGTGCGGCAA CGGATCTGGC GCTCTTTGAT TACCCTTTTG
GCCCACCCCA GCCTCTTCCC TTTGGTCAGT CGTATTCCGG TTTCTTTCCA GGAACGGTTG
AAGCGCCTAA TTAAGGGGTA G
 
Protein sequence
MSTLPTPPRP RPVDIIIPVY KGLEETRLCL ESVLATLPAA DGLIVIDDHS PDPALVAYLK 
DRAAGDRRIR LLHNPENLGF VGTVNRGMAL EPERDVLLLN SDTEVAGDWV ARLRAAAYAD
RRIGTVTPFS NNATICSWPR FCQDNPLPPG LDVAGVDRAF AAANPGLSID IPTGVGFAFY
IRRDCLDEVG LFNAEAFGKG YGEENDFCRR AHHRGWRNVL AADTFVYHSG NVSFGVNQER
LDSAMRQLLA LHPDYRRVVQ LHINQDPAQN MRWRAALGIL RQSALPVLLF VTHNHGGGTL
SHVHELAKAL EGRAWGLLLT PGPRNTAVVT LPASLGGDAL PFDLEQDWDG LLDLLRYAGV
NRLHLHHMLG VPERLLDLPE QLSIPFDFTA HDFHAGCPRV MLCGPGSRYC GQPEERALCD
ACLAQAPKTE AGDITSWRAA MVARLSRAER LFAPSADTAN RLKRMLPALS FRAIPHPDAQ
GLATNAPPPL LRPCGTTEPL RILVLGALSR AKGADLVEAT AREAARGDLP LEIHLLGYGY
RPLHRARGRL TAHGRYHPEE IAGHLERIAP HVAWLPAGWP ETYSYTLSEV MAAGLPVVVS
DLGAPPERIL GRPLSWVLPW NVDASTAAAF FGRLRAGEIP ASPEAPALSP QSTPRVDFYR
EGYLAEVFPE KARARLPERE IGELIGQALD RAGARRRRLQ CWGDLKVVRQ RIWRSLITLL
AHPSLFPLVS RIPVSFQERL KRLIKG