Gene Rru_A3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3539 
Symbol 
ID3836994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp4070193 
End bp4073003 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content65% 
IMG OID637827662 
ProductPII uridylyl-transferase 
Protein accessionYP_428620 
Protein GI83594868 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATAC CCCGCATCCG GCAGCCCCGG GCCGTTATCG ACCGCAAGGC GCTGACCGTC 
GTTCTCGAGG ATCTGGCGGC GACGGTGACC GATAACCGCG AGCGCCGCGC CCGGTTGCTG
GCGGTGCTCA AGGGCGCCCT GGGCGATGGG CGGGCCGAGG TGCGACGGCG TTTCCTCGAG
GAAAAGGGCA CGGGCGCGGC GGTCTTCGCC GAAAACAGCC ATCTGATGGA CCAGATCATC
CGGCTGCTGT TTGATTTCAC CACCACCCAT GTCTATCCCC GGGCCAACCG CACTATCGGC
GAGCAGATGA CCGTGCTCGC CGTCGGCGGC TATGGGCGCG GCGAGATGTC GCCGCAGTCC
GATGTCGATC TGCTGTTCCT GCTGCCCTAC AAGGCGACGC CGCTGCATGA ACAGGTCGTG
GAATACATGC TCTATACCCT ATGGGACATG GGGCTGAAGG TCGGTCATGC CACGCGCTCG
ATCGAGGAAT GCATCCGTCA GGCGCGCGGC GATCTGACCA TCCGCACCGC CATGCTCGAA
ACCCGCTATC TGTGGGGCGA CCGGGCGTTG TACGGCCAGC TTAAAACCAA ATTCTGGACC
GGCGTCGTCA CCGGCACCGG CCCCGATTTC GTCGAGGCCA AGCTGGCCGA GCGCGACGAG
CGCCACCTGC GCATGGGCGA CAGCCGTTAT GTGCTGGAAC CCAACATCAA GGACGGCAAG
GGGGGCTTGC GCGATCTTCA TACCTTGCTG TGGATCGCCC GCTATATCTA CGGCGTCTCC
GACATGCGCG AACTGGTGGA ACTGGGCGTG CTCAGCGCCG ACGCTGCGAC CAAATTCGGC
CGGGCCCGGG CCTTCTTGTG GACGGTGCGC TGCCATCTGC ACTATCTGGC CGACCGCCCG
GAAGAGCGTC TGACCTTCGA TGTCCAGCCG GCGATCGCTG CGCGCATGGG CTATACCGAC
CGCAACAGCG GCCGCGGCGT CGAACGCTTC ATGAAGCATT ATTTCCTGAT GGCCAAGACC
GTGGGCGATT TGACCCGCAT CTTCTGCGCC GTGCTCGAAG ACCAGCAAAA GCGCCGGCCG
ATCCTGTCGA TCGCCACCTT GCTGATGCGC AAGCGCAATC TGGGGGATTT CGTGCTTGAT
GGCGGGCGGC TGGCGGTGGC CGGGCGCGGC GCCTTTCGCG AGCATCCCTT GCAATTGATC
AGCCTGTTCA AGGTCGCCCA TGACCACGGC CTGGATATCC ACCCCGATAC CCTGCGTCTG
GTCACCGAGC ATCTGCCCAC CGTGACGCTG CTGCGCAACG ACGCGAAGGC CAATGCCCTG
TTCATGGAGA TCCTGACCTC GCGCAAGGAC CCGGAACTGG CGCTGCGCCA GCTGTCGGAA
TCGGGGGTGC TCGCCCGCTT CATTCCCGAT TTCGGGCGGG TCACCGCCCA GATGCAGTTC
GACATGTACC ATGTCTATAC CACCGACGAG CATACCATCC GCGCCATCGG CCTGCTCCAT
CGCCTGGAGA CCGGGGCGCT GCGCGACCGC ATGCCCGCCG CCGCCGATGC CGTTCACAAG
GTGCAGTCGC GCCGGGCGCT GTATCTGGCG GTGCTGCTCC ACGATATCGC CAAGGGCCGG
GGCGGCGATC ACTCGATTTT GGGCGCCGAG GTGGCGATGC GCCTGGGTCC GCGCCTGGGG
ATGAGCGAAG AGGAAACCGA AACCGTCGCT TGGCTGGTGC GCCATCACCT TGATATGTCG
CGCACCGCCT TCAAGCGCGA CCTTGACGAC ATCAAGACCA TCCTTGATTT CACCGGCTTG
GTGCAATCGG TCGAACGCCT GCATCTGCTG CTCGCCCTGA CCACGGTCGA TATCCTGGCC
GTTGGTCCGG CGGTCTGGAA CAACTGGAAA TCCTCGCTGC TGCGCGAGTT GTACACCCAC
TCCAAGGACG TGCTGACCAG CGGCTTCCAG GCCGAGGCCC GCGACAAGCG CGTCGCCCAT
AAGCGCGAGG AGCTGGCCGC GGCGCTGGCC GATTGGCCGC AGGCCTCGCG CGAGCGCTAC
CTTGATCTGC ATTATCCGGC CTATTGGCTG ACCTTCGACA GCGCCACCCA TCTGCGTCAC
GCCCGCATGC TGCGCCGCGC CCGCGACGCC GGCTTGACCG TGGCCGTCGA GGTGCTGCCC
GATCCCGAGC GCGCGGTTTC CGAGGTGCTG GTCGCCACCG ACGACCATCC GGGGCTGTTT
TCCAAGATCG CCGGGGCGAT GGCCCTGGCC GGGGTCAATA TTCTAGATGC CAAGATCACC
ACCATGTCCG ATGGCGGGGC GCTTGATATC TTCACCGTCC AAACCCTTGA AGGGCACGCC
ATCGAAAAGG AAGAGCGCAT CGCCCGGCTG GCCAAAACCG TGCGCGATGT ACTGACGGGC
GATCTGCCCT TGGAAAAGGC CCTGCGCCGC CAGCCGCCGC GCCTGCCCGA ACGCACCCGC
CACCTGACCG TGCCGCCGCG CGTCATCGTC GACAATCAGG CCTCCAAGAC CCATACGGTG
ATCGAGATCA ACGGCCGCGA CCGCCCCGGA TTCCTCTACG CGGTCACCCG GGCGCTGACC
GATGTGGCGG TGCAGATCTC CTCGGCCCGG GTCTCGACCT ATGGCGAGCG CGTGGTCGAC
AGCTTCTATG TCAAGGATGT GTTCGGCATG AAGATCGTCC ACCGCGCCAA GCTGGCCCAG
ATCCGCGAGG CTTTGGAGGC GGCGATCACC CAGACCGTGC CGCGCAAAGT CGAAGAGGGG
GCCGAGCAGG GGGCCGAAAA GGCCGACGCC GGGGAGATCG TCGCCGCCTG A
 
Protein sequence
MTIPRIRQPR AVIDRKALTV VLEDLAATVT DNRERRARLL AVLKGALGDG RAEVRRRFLE 
EKGTGAAVFA ENSHLMDQII RLLFDFTTTH VYPRANRTIG EQMTVLAVGG YGRGEMSPQS
DVDLLFLLPY KATPLHEQVV EYMLYTLWDM GLKVGHATRS IEECIRQARG DLTIRTAMLE
TRYLWGDRAL YGQLKTKFWT GVVTGTGPDF VEAKLAERDE RHLRMGDSRY VLEPNIKDGK
GGLRDLHTLL WIARYIYGVS DMRELVELGV LSADAATKFG RARAFLWTVR CHLHYLADRP
EERLTFDVQP AIAARMGYTD RNSGRGVERF MKHYFLMAKT VGDLTRIFCA VLEDQQKRRP
ILSIATLLMR KRNLGDFVLD GGRLAVAGRG AFREHPLQLI SLFKVAHDHG LDIHPDTLRL
VTEHLPTVTL LRNDAKANAL FMEILTSRKD PELALRQLSE SGVLARFIPD FGRVTAQMQF
DMYHVYTTDE HTIRAIGLLH RLETGALRDR MPAAADAVHK VQSRRALYLA VLLHDIAKGR
GGDHSILGAE VAMRLGPRLG MSEEETETVA WLVRHHLDMS RTAFKRDLDD IKTILDFTGL
VQSVERLHLL LALTTVDILA VGPAVWNNWK SSLLRELYTH SKDVLTSGFQ AEARDKRVAH
KREELAAALA DWPQASRERY LDLHYPAYWL TFDSATHLRH ARMLRRARDA GLTVAVEVLP
DPERAVSEVL VATDDHPGLF SKIAGAMALA GVNILDAKIT TMSDGGALDI FTVQTLEGHA
IEKEERIARL AKTVRDVLTG DLPLEKALRR QPPRLPERTR HLTVPPRVIV DNQASKTHTV
IEINGRDRPG FLYAVTRALT DVAVQISSAR VSTYGERVVD SFYVKDVFGM KIVHRAKLAQ
IREALEAAIT QTVPRKVEEG AEQGAEKADA GEIVAA