Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0014 |
Symbol | |
ID | 3833901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 14699 |
End bp | 15799 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637824083 |
Product | KpsF/GutQ |
Protein accession | YP_425106 |
Protein GI | 83591354 |
COG category | [K] Transcription [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAGC GGAAAACCGG CAGTCCCTTT TCCGCTCGCC GCTCTAGACT TGGCTTCGAG GAAGGCAACG GTTTCGCGAT GACGATCCCC TCTCCCAGCA TTTCCGCTCC GGTGGCCGAC CTCTCCGACG AGGCGGGCCG CGCCCTCGCT TGCGCCCGCC ATGTCCTCGA GGCCGAGGCC GAGGCCCTGC GGGCGCTGGC CGCCGATCTG AACGGCGCCT TCACCGCCGC CATCGACCTG CTGTGCGACG GCCCGGCCAA GCGATCGGGC AAGGTGATCA TTTCGGGCAT GGGCAAAAGC GGCCATGTCG CGGCCAAGAT CGCCGCCACC CTGGCCTCGA CCGGAACGCC GTCGTTCTTC GTCCACCCCG CCGAAGCCAG CCACGGCGAC CTGGGGATGA TCGGGCGCAG CGACGCGGTG ATCGCCCTGT CGAATTCCGG CGAAACCCCC GAACTGGCCG ATATGGTGGC CTATACCCGG CGCATGGGCA TTCCGCTGAT CTCGATCACC GGCCGTCATC CCAGCGCCCT GTCGGACGCC GCCGATGTCG CCCTGGTGCT GCCGGCCTTG ACCGAGGCCT GCCCCCATGG CCTCGCCCCC ACCACCTCGA CCACGGCGAT GATGGCCCTG GGCGACGCCC TGGCCGTGGC CCTGCTCGAG CGTCGCGGCT TCACCGCCAG CGATTTCCGG CTGTTCCACC CCGGCGGCCA GTTGGGGCGC AAGCTGCTCA AGGTCGCCGA CCTGATGCAC GGCCAAGACC GCCTGCCGCT GGTCGGCCCG GCCACGCCGA TGGCCGAGGC CATCCTTGAA ATCAGCTCCA AAAGCCTGGG CTGCGTCGGT GTCGTCGACG CGGCCGGCCG CCTCGCCGGC ATCATCACCG ATGGCGACCT GCGCCGCCAT ATGGGCGCCG ACCTGTGGTC GCGCACCGCC GGCTCGGTGA TGACCCCCAC CCCCAAGACC ATCGCGCCGA CGACCCTGGC GATCGAGGGC CTGCGGATCA TGAACGAAAG CGCCATCACC GGCCTGTTCG CCCTTGACGC CGACAAGCGC CCGGTCGGTT TCCTGCATCT GCATGACTGC CTGAGGGCGG GGCTTGCATG A
|
Protein sequence | MTERKTGSPF SARRSRLGFE EGNGFAMTIP SPSISAPVAD LSDEAGRALA CARHVLEAEA EALRALAADL NGAFTAAIDL LCDGPAKRSG KVIISGMGKS GHVAAKIAAT LASTGTPSFF VHPAEASHGD LGMIGRSDAV IALSNSGETP ELADMVAYTR RMGIPLISIT GRHPSALSDA ADVALVLPAL TEACPHGLAP TTSTTAMMAL GDALAVALLE RRGFTASDFR LFHPGGQLGR KLLKVADLMH GQDRLPLVGP ATPMAEAILE ISSKSLGCVG VVDAAGRLAG IITDGDLRRH MGADLWSRTA GSVMTPTPKT IAPTTLAIEG LRIMNESAIT GLFALDADKR PVGFLHLHDC LRAGLA
|
| |