Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_B0044 |
Symbol | |
ID | 3833375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007641 |
Strand | + |
Start bp | 44416 |
End bp | 46656 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637824063 |
Product | glycosyl transferase family protein |
Protein accession | YP_425080 |
Protein GI | 83582774 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACTC TCCCCACCCC GCCGCGCCCC CGCCCCGTCG ATATCATCAT TCCGGTCTAT AAGGGGCTGG AGGAAACCAG GCTGTGCCTG GAGAGCGTTC TGGCGACGCT TCCCGCCGCC GACGGTCTCA TCGTTATCGA TGACCACAGC CCCGACCCCG CCCTGGTCGC CTATCTGAAG GACCGCGCGG CCGGCGATCG GCGGATAAGA CTGCTCCACA ACCCCGAAAA CCTTGGTTTC GTCGGCACCG TCAACCGGGG CATGGCGTTG GAGCCCGAGC GCGACGTCCT GCTGCTCAAC AGCGATACCG AGGTTGCCGG CGACTGGGTG GCGCGGCTGC GCGCGGCGGC CTATGCCGAT CGGCGGATCG GCACGGTCAC GCCCTTTTCC AATAACGCGA CGATCTGCAG CTGGCCCCGG TTCTGCCAGG ACAATCCTTT GCCCCCCGGG CTGGACGTGG CCGGGGTCGA CAGGGCCTTC GCCGCCGCCA ATCCCGGACT GAGCATCGAT ATTCCAACCG GGGTGGGCTT CGCCTTTTAC ATCCGCCGCG ATTGCCTTGA TGAGGTCGGC CTGTTCAATG CCGAGGCCTT CGGCAAGGGC TATGGCGAGG AAAACGACTT CTGCCGCCGG GCCCATCACC GGGGGTGGCG CAACGTGCTC GCCGCCGACA CCTTCGTTTA TCACTCGGGC AACGTCAGTT TCGGGGTCAA TCAGGAGCGT CTGGATAGCG CCATGCGCCA GTTGCTGGCC CTTCATCCCG ATTATCGCCG GGTGGTCCAG CTCCATATCA ACCAGGATCC GGCCCAAAAC ATGCGCTGGC GCGCCGCCCT CGGCATCCTG CGCCAAAGCG CCTTGCCGGT GCTGCTGTTC GTTACCCATA ACCATGGCGG CGGCACCCTG AGCCATGTCC ACGAACTGGC GAAGGCCCTT GAAGGGCGGG CCTGGGGGCT TTTGCTGACG CCGGGGCCGC GTAATACCGC CGTTGTGACC TTACCCGCTT CCCTGGGCGG CGACGCCCTG CCCTTCGATC TGGAGCAAGA CTGGGACGGG CTGCTTGATC TGTTGCGCTA TGCGGGGGTG AACCGCCTGC ATCTTCACCA TATGCTGGGC GTTCCCGAAC GGCTTTTGGA TCTGCCCGAA CAGCTTTCCA TTCCCTTTGA CTTCACCGCC CATGATTTTC ACGCTGGCTG TCCGCGGGTC ATGCTCTGCG GTCCAGGCTC CCGCTATTGC GGCCAGCCCG AAGAGCGGGC GCTCTGCGAT GCTTGTCTGG CCCAGGCGCC GAAGACCGAA GCCGGCGATA TCACGTCTTG GCGCGCGGCG ATGGTCGCCC GTTTGAGCCG GGCCGAGCGG TTGTTCGCGC CCAGCGCCGA TACCGCGAAC CGCCTGAAGC GGATGCTGCC CGCCCTCTCC TTTCGCGCCA TCCCCCATCC CGACGCCCAG GGCCTCGCCA CCAACGCGCC GCCCCCTCTC CTGCGTCCCT GTGGGACGAC CGAACCCTTG CGTATCCTGG TGCTTGGCGC CCTGAGCCGG GCCAAGGGCG CCGATCTGGT CGAAGCGACG GCACGCGAGG CGGCCCGGGG CGATCTTCCC TTGGAAATCC ACCTTCTCGG CTATGGCTAC CGACCGCTGC ATCGCGCGCG GGGGCGACTG ACCGCCCATG GCCGCTACCA CCCCGAGGAG ATCGCCGGGC ATTTGGAGCG CATCGCTCCC CATGTGGCTT GGCTGCCGGC CGGTTGGCCG GAAACCTACA GCTACACTTT GAGCGAGGTC ATGGCCGCCG GATTGCCGGT GGTGGTCAGC GATCTTGGCG CCCCACCAGA GCGCATCCTC GGCCGCCCTT TGTCCTGGGT TCTGCCCTGG AACGTCGATG CTTCCACAGC GGCGGCGTTC TTCGGGAGGC TGCGCGCCGG GGAAATCCCC GCCTCCCCGG AGGCCCCTGC CCTCTCGCCG CAAAGCACGC CCCGGGTGGA TTTTTATCGC GAAGGCTATC TTGCCGAGGT TTTTCCGGAG AAAGCACGCG CGCGCCTTCC AGAGCGGGAA ATCGGCGAGC TTATTGGGCA AGCCCTCGAT CGCGCCGGGG CGCGCCGGCG ACGCCTACAG TGTTGGGGCG ATCTGAAAGT CGTGCGGCAA CGGATCTGGC GCTCTTTGAT TACCCTTTTG GCCCACCCCA GCCTCTTCCC TTTGGTCAGT CGTATTCCGG TTTCTTTCCA GGAACGGTTG AAGCGCCTAA TTAAGGGGTA G
|
Protein sequence | MSTLPTPPRP RPVDIIIPVY KGLEETRLCL ESVLATLPAA DGLIVIDDHS PDPALVAYLK DRAAGDRRIR LLHNPENLGF VGTVNRGMAL EPERDVLLLN SDTEVAGDWV ARLRAAAYAD RRIGTVTPFS NNATICSWPR FCQDNPLPPG LDVAGVDRAF AAANPGLSID IPTGVGFAFY IRRDCLDEVG LFNAEAFGKG YGEENDFCRR AHHRGWRNVL AADTFVYHSG NVSFGVNQER LDSAMRQLLA LHPDYRRVVQ LHINQDPAQN MRWRAALGIL RQSALPVLLF VTHNHGGGTL SHVHELAKAL EGRAWGLLLT PGPRNTAVVT LPASLGGDAL PFDLEQDWDG LLDLLRYAGV NRLHLHHMLG VPERLLDLPE QLSIPFDFTA HDFHAGCPRV MLCGPGSRYC GQPEERALCD ACLAQAPKTE AGDITSWRAA MVARLSRAER LFAPSADTAN RLKRMLPALS FRAIPHPDAQ GLATNAPPPL LRPCGTTEPL RILVLGALSR AKGADLVEAT AREAARGDLP LEIHLLGYGY RPLHRARGRL TAHGRYHPEE IAGHLERIAP HVAWLPAGWP ETYSYTLSEV MAAGLPVVVS DLGAPPERIL GRPLSWVLPW NVDASTAAAF FGRLRAGEIP ASPEAPALSP QSTPRVDFYR EGYLAEVFPE KARARLPERE IGELIGQALD RAGARRRRLQ CWGDLKVVRQ RIWRSLITLL AHPSLFPLVS RIPVSFQERL KRLIKG
|
| |