Gene Rru_A0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A0014 
Symbol 
ID3833901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp14699 
End bp15799 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content69% 
IMG OID637824083 
ProductKpsF/GutQ 
Protein accessionYP_425106 
Protein GI83591354 
COG category[K] Transcription
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAGC GGAAAACCGG CAGTCCCTTT TCCGCTCGCC GCTCTAGACT TGGCTTCGAG 
GAAGGCAACG GTTTCGCGAT GACGATCCCC TCTCCCAGCA TTTCCGCTCC GGTGGCCGAC
CTCTCCGACG AGGCGGGCCG CGCCCTCGCT TGCGCCCGCC ATGTCCTCGA GGCCGAGGCC
GAGGCCCTGC GGGCGCTGGC CGCCGATCTG AACGGCGCCT TCACCGCCGC CATCGACCTG
CTGTGCGACG GCCCGGCCAA GCGATCGGGC AAGGTGATCA TTTCGGGCAT GGGCAAAAGC
GGCCATGTCG CGGCCAAGAT CGCCGCCACC CTGGCCTCGA CCGGAACGCC GTCGTTCTTC
GTCCACCCCG CCGAAGCCAG CCACGGCGAC CTGGGGATGA TCGGGCGCAG CGACGCGGTG
ATCGCCCTGT CGAATTCCGG CGAAACCCCC GAACTGGCCG ATATGGTGGC CTATACCCGG
CGCATGGGCA TTCCGCTGAT CTCGATCACC GGCCGTCATC CCAGCGCCCT GTCGGACGCC
GCCGATGTCG CCCTGGTGCT GCCGGCCTTG ACCGAGGCCT GCCCCCATGG CCTCGCCCCC
ACCACCTCGA CCACGGCGAT GATGGCCCTG GGCGACGCCC TGGCCGTGGC CCTGCTCGAG
CGTCGCGGCT TCACCGCCAG CGATTTCCGG CTGTTCCACC CCGGCGGCCA GTTGGGGCGC
AAGCTGCTCA AGGTCGCCGA CCTGATGCAC GGCCAAGACC GCCTGCCGCT GGTCGGCCCG
GCCACGCCGA TGGCCGAGGC CATCCTTGAA ATCAGCTCCA AAAGCCTGGG CTGCGTCGGT
GTCGTCGACG CGGCCGGCCG CCTCGCCGGC ATCATCACCG ATGGCGACCT GCGCCGCCAT
ATGGGCGCCG ACCTGTGGTC GCGCACCGCC GGCTCGGTGA TGACCCCCAC CCCCAAGACC
ATCGCGCCGA CGACCCTGGC GATCGAGGGC CTGCGGATCA TGAACGAAAG CGCCATCACC
GGCCTGTTCG CCCTTGACGC CGACAAGCGC CCGGTCGGTT TCCTGCATCT GCATGACTGC
CTGAGGGCGG GGCTTGCATG A
 
Protein sequence
MTERKTGSPF SARRSRLGFE EGNGFAMTIP SPSISAPVAD LSDEAGRALA CARHVLEAEA 
EALRALAADL NGAFTAAIDL LCDGPAKRSG KVIISGMGKS GHVAAKIAAT LASTGTPSFF
VHPAEASHGD LGMIGRSDAV IALSNSGETP ELADMVAYTR RMGIPLISIT GRHPSALSDA
ADVALVLPAL TEACPHGLAP TTSTTAMMAL GDALAVALLE RRGFTASDFR LFHPGGQLGR
KLLKVADLMH GQDRLPLVGP ATPMAEAILE ISSKSLGCVG VVDAAGRLAG IITDGDLRRH
MGADLWSRTA GSVMTPTPKT IAPTTLAIEG LRIMNESAIT GLFALDADKR PVGFLHLHDC
LRAGLA