Gene Rru_A3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3107 
Symbol 
ID3836553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3577151 
End bp3578578 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content66% 
IMG OID637827222 
Productglycosyl transferase, group 1 
Protein accessionYP_428189 
Protein GI83594437 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.27638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCTGTTC GGGCAAGAAA CGTAAAGGGG AATTCAAATC CCAGACTCAA TTCTTATATT 
ATTCTAGTTA AAGTTTTCGA CGGCCCGGAA ACCCCGGTTT CGCGCCAGTC GCCCCATCTG
ACCCGGGATT CTTGTCAGCA CTGTGATCCA TTCACCCTCA ACCGGGGCGC TATCGTGAAA
ATCCTTTATC ACCACCGAAC CCAGGCCAAC GACGGATGCG CCGTTCATAT CACCGAAATG
ATCGCCGCCC TGCGGCGCGA CGGTCACGAG GTGGTGGTGG TGGCGCCGGC GGTGGCGAAG
GGCGAACCCT CGGCCGAGAA GACCACGGGC GGGCTGATCG CCACTTTGCG CAAAAGACTT
CCCAAGGCCG CCTTTGAAGC TCTGGAGTTC CTGTATTCCG GGTTTGCTTA TTTTCGTTTA
TTGCGCGCGG TGTTTTCCCA CCGCCCCGAC GTTCTTTATG AACGCTACAG CCTTTTCATG
CCGACCGGCA CTTGGATTCG CCGAACCTGC GGCCTGCCGG TTCTCCTCGA GGTCAATTCG
CCTTTGCGCG AGGAGCGCGC CCGGCACGGC GGACTGGCCC TGGGCGCCCT GGCCGGCTGG
ACCGAGCGGG TGTCATGGAA AGGCGCCGAT CGCGTGCTGC CGGTCACCGC CGTGCTCGCC
CGCCAGATCA GCGCCATCGG CGTGGCCGAA GGGCGGATCA GCGTGATCGC CAATGGCATC
AATCCCCAAA CCTTCGGCCC TTTGCCCGAG GGCGACCAAG CCAAGGCCGC CCTTGGTCTG
GAGGGCAAAC TGGTTCTCGG CTTCACCGGC TTCGTCCGCG ATTGGCACGG ATTGGACCGG
GTGATCGAGG CGCTGCCCCG CACCCCCCAG GCCCATCTGC TGATCGTCGG CGACGGCCCG
GCGCGCCAGG ATCTGCTCGC CCGCGCCCAG CAGATGGACG TTGGCGAGCG CGTCAGCTTT
ACCGGCGTGC TGCCCCACGC CCGCATCGCC GGCCATGTCG CCGCCTTCGA TATCGCCCTG
CAGCCGGCGG TCACCGCCTA TGCCTCGCCG CTCAAGCTTT TTGAATACCT TCAGATGGGG
CGGACCATTC TGGCTCCCGA CCAGCCCAAT CTGCGCGAGA TCCTGACCGA TGGCGTGAAC
GCCCGTCTGT TCGACGCCGA GCGCGGCGAA GCCTTCGCCG AAGCGCTCGA CGGCCTGATC
GCCAACCCCG AGGAGCGCCG CCGTCTGGCC GAAGGCGCCC GGGCGACGAT CGCCCGCCTT
GGCTTGACCT GGGATCACAA CGCCCGCCGG GTCGTCGCCC TCGCCCAAGC CGCGCTTCTC
GCCTGCCCCC GCCGGCGCCC CGCCGCCGGA ACGGCAGGGC CCCTTGGTCC GACGGCGGCC
CGCCCTTCCT CCCGGGCAGC GGACGATCCC CCCGGCGCGC CGCGGTAG
 
Protein sequence
MSVRARNVKG NSNPRLNSYI ILVKVFDGPE TPVSRQSPHL TRDSCQHCDP FTLNRGAIVK 
ILYHHRTQAN DGCAVHITEM IAALRRDGHE VVVVAPAVAK GEPSAEKTTG GLIATLRKRL
PKAAFEALEF LYSGFAYFRL LRAVFSHRPD VLYERYSLFM PTGTWIRRTC GLPVLLEVNS
PLREERARHG GLALGALAGW TERVSWKGAD RVLPVTAVLA RQISAIGVAE GRISVIANGI
NPQTFGPLPE GDQAKAALGL EGKLVLGFTG FVRDWHGLDR VIEALPRTPQ AHLLIVGDGP
ARQDLLARAQ QMDVGERVSF TGVLPHARIA GHVAAFDIAL QPAVTAYASP LKLFEYLQMG
RTILAPDQPN LREILTDGVN ARLFDAERGE AFAEALDGLI ANPEERRRLA EGARATIARL
GLTWDHNARR VVALAQAALL ACPRRRPAAG TAGPLGPTAA RPSSRAADDP PGAPR