Gene Rru_A3118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3118 
Symbol 
ID3836564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3595958 
End bp3597556 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content64% 
IMG OID637827233 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_428200 
Protein GI83594448 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0491036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGCGA TTTTCGAGGC CCTTCTCTCC CATGTTCCCG CCATGTGGCG CCGCCGTTGG 
ATCGTCATGG TGGTGGCCTG GGTGGCGTGC CTTGTCGGCT GGGGCGCCGT CGCCCTCATC
CCCAATGTCT ATTCGACGGC GACGACGGTT TATGTTGAAA CCGAGTCCCT GTTGCGCCCG
CTGCTCCAGG GCATGACCAT CGATGTCGGC ACCGACAAGC TCCGCATCAT GAACCAGACC
CTGACCAGCC GGCCCAACCT GGAAAAGGTG GTGCGGGTGG TCGGCCTCGG CGGTTCCTCG
CCCGCCGATT TCAACGCGGC GGTCACCAAG GTGCAAAAGG ATATTTCGGT CCGCATGTCG
GAACGCAATA ACACCTTCAC TATTTCCTAT GGCGCCCGCG ATCCCGAAGT CGCCTTCCGC
GTCGTCGACA CCTTGTTGAC CATCTTCGTT GAAAGCAATC TTGGCTTCGC CCGCCAGGAA
CTTGACGTCG CCCGCGAGTT CATCGGCAAG CAGGTCAAGG AATACGAAGA GCGGGTCGCC
GAGGCCGAGC GCAAACTTGC CGACTTCAAG CGCGAGAACA CCGAGTTCCT GCCCGGGGCC
TCGACCTACC GCGAGTATCT CGACGGCCTG CGCCGCGAGA CGGCGGCGAT CGCCGCCGAA
ATCGCCGATA CCAAGGCCAA GGTCGCCTCG ATGACCACCC AGCTTGGCGA TCTCAAGCCG
ATGGTTCCCT CAGGGGGGGC GGCGGCGATG TCGCCGCTGC AGATGCGGGT CGAGCAGATG
CGAGCCTCGC TCGACGATCT TAAGTTGCGC TATACGGAAA AGCATCCGGC GATGGAGGAG
ACGCGCCGGC AACTGGCGAT CCTTGAAGAT CGCCTGCGCA GCGAGGGCGG ATCGTCCGCC
GGCGGGATGG GGGCCATTCC CAATCCGGTC TATGAGAACG TCCGCATGGA AAAGGTCGCC
CTGACGGCGC AGATCGCCGC CTTGGAAAAC CGTTTCACCC GCAAGACCGA GGAAGTGGCC
CAACTGGAGG CGAAGGCGCC CCGGGTTCCC GAGGTCGAGG CCGAGTTTAC CCGACTGACC
CGTGACGCCG AGCTGGTCCG CCGCCAGTTC GAGGAGCTCC AGGCCCGCCA GGAGGCGGCC
AAGCTGTCGC AAAGTCGGGA AATGGAAGCC GAAAAGGTCC AGTTCCGGGT GATCAACCCG
CCGCAGAAGG CCCAGCATCC CTCGGGCATC AAGCGGTCCT TGCTGATTAC GGCGGTGCTG
TTTGGCGGCC TTGGCCTGGG GCTGGCCCTG ACCTTCCTGA TCAGCGCCAT GCAATCGACC
TTTTATAGTC CCTATCGCAT CATCCATGCG CTGGGCGTGC CGGTGATCGG AACGATCTCG
TTCATCGCCC CCCGGCGAAC CCTTGGCTCC AAGCTGGCCC TCGGCTCGTT CAGCGCCAGC
GCCTTGCTGC TGGTCGTCGC CTGGATGGCC CTGATGCTGG TGGAATACCA ATGGGGACTG
CCCACCTTGC TGCCGCAGCG GCTGCAAGAC CAGTTCTCCT TCCCGATGAC CGCGTCGCTC
GCCAGTGGCC ATTCCCTCCC GGCACAGCCG GCGGTTTGA
 
Protein sequence
MNAIFEALLS HVPAMWRRRW IVMVVAWVAC LVGWGAVALI PNVYSTATTV YVETESLLRP 
LLQGMTIDVG TDKLRIMNQT LTSRPNLEKV VRVVGLGGSS PADFNAAVTK VQKDISVRMS
ERNNTFTISY GARDPEVAFR VVDTLLTIFV ESNLGFARQE LDVAREFIGK QVKEYEERVA
EAERKLADFK RENTEFLPGA STYREYLDGL RRETAAIAAE IADTKAKVAS MTTQLGDLKP
MVPSGGAAAM SPLQMRVEQM RASLDDLKLR YTEKHPAMEE TRRQLAILED RLRSEGGSSA
GGMGAIPNPV YENVRMEKVA LTAQIAALEN RFTRKTEEVA QLEAKAPRVP EVEAEFTRLT
RDAELVRRQF EELQARQEAA KLSQSREMEA EKVQFRVINP PQKAQHPSGI KRSLLITAVL
FGGLGLGLAL TFLISAMQST FYSPYRIIHA LGVPVIGTIS FIAPRRTLGS KLALGSFSAS
ALLLVVAWMA LMLVEYQWGL PTLLPQRLQD QFSFPMTASL ASGHSLPAQP AV