Gene Rru_A3721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3721 
Symbol 
ID3837178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp4271848 
End bp4272936 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content65% 
IMG OID637827846 
Productsugar transferase 
Protein accessionYP_428802 
Protein GI83595050 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.243024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCCG CAGAGACATC CTTGCACGCC TTGGCCGAAG ACAAGCGCGG CACGAGGCGC 
CCGTCCTCGC CGTCCCGCCT TCCTAGGCCG ATGCGCGTCG CCCTGGTCGG CGCCGGTCTT
CCCGCCGCCG CCCTGATCGA ATTCCTCCAA TCCGACGACG CCCGCGACGA TTACACCGTC
GTGGCGATCT TCGATGAACG CGGCGATCGC CGGCCACCGG TGTTGGGTGC CAAACCGGTG
GAAAAGGGCC TGAGCGGCCT GCGCGACCTT GCCGAGGCCG GCAAGATCGA CGCCATTCTT
CTGACGCTCT ACGGCGCCTC CTCGGCGCGC ATGTTCGAGA TCATCGAGCG CATCGGCACC
ACCGCCGTCG ATATCTATCT GCCGCGCGAG CACAAAGATA AGCACTTCTC GTGGACCAGC
TATCACCTGA TCGGCGGCCT GCCGTTCCTG TGCATCAAGG GGCGGCCCTT CCAGGGGTTC
GCCGGCGTTT TCAAAAGGAT CGAGGATTAT ACCCTGGCGG TTCTGGCGCT GACGCTGATC
GGCCCGATCC TGCTGCTGGC CATGCTGGCG ATCCGCCTGG AAAGCCCCGG CCCGGCCCTG
ATCCATCAGC GGCGCATCGG CCTTGGCGGG AAGCTGTTTT CCATGCTCAA GTTGCGCTCC
ATGCGCTTCG ATCCCAATGA CGACGGGCGG AACGGCGCCA TCGCCAATGA TCCGCGCATC
ACCCGGGTCG GCGCCTTCCT GCGCGCCACC AGCATCGACG AGCTGCCCCA GGTCCTCAAT
GTCCTGCGCG GCGACATGTC GATGATCGGC CCGCGTCCCC ATGTTCCCAA CATGCTGGTC
GAAAACACCG TTTACGGGGT GAGTGTCAGC GAATACGTCG CCCGCCACCG GGTGCGGCCG
GGGATCACCG GTTGGGCCCA GGTGAATGGC ATGCGCGGCG GCATTCATGA TATCGAAAAG
GCCCGGCGCG GCGCGCAACT GGATATCTAT TACATCGAGA ACTGGACACC ATGGCTCGAC
ATCAAGATCC TCTGGCGGAC GATTTTCGGC GGCCTGCGCG ACCCCTCCGC TCTGCGCTCG
CCGGGCTGA
 
Protein sequence
MASAETSLHA LAEDKRGTRR PSSPSRLPRP MRVALVGAGL PAAALIEFLQ SDDARDDYTV 
VAIFDERGDR RPPVLGAKPV EKGLSGLRDL AEAGKIDAIL LTLYGASSAR MFEIIERIGT
TAVDIYLPRE HKDKHFSWTS YHLIGGLPFL CIKGRPFQGF AGVFKRIEDY TLAVLALTLI
GPILLLAMLA IRLESPGPAL IHQRRIGLGG KLFSMLKLRS MRFDPNDDGR NGAIANDPRI
TRVGAFLRAT SIDELPQVLN VLRGDMSMIG PRPHVPNMLV ENTVYGVSVS EYVARHRVRP
GITGWAQVNG MRGGIHDIEK ARRGAQLDIY YIENWTPWLD IKILWRTIFG GLRDPSALRS
PG