Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4101 |
Symbol | |
ID | 5086274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 153746 |
End bp | 155914 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640485664 |
Product | hypothetical protein |
Protein accession | YP_001170258 |
Protein GI | 146280101 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.965422 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.2163 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCGGG TGGCGTCAGA TGAAGACGAG ATCGACCTGG GCCGGATCCT GGCCCAGCTC TGGGCGGGAC GCTTCCGGAT CGCGGGCTCC ACAGCGGCAG CGGCCGTCCT GGCGCTCGTT CATCTCGCCG ACACGCCCCC CACCTTTCGG GCCGAGGCGC TCCTGCAGCT CGAAGAGAAG GCGGCACAGG CCCTGCCGGC AGCGCTTTCC GACATTGCCG GTTTGGAGCC GCGCATCGCC GCCGAGATCG AGATCCTGCG CTCGCATTCG GTGCTGGCGG AGGCGGTGGC GGCTCATCGT CTCGATCTCC AGGCCCTGCC GGTCCAGGCG CCTGTCCTGG GCCATGCGGT GGCCTCGGGG CGACTGCCGC TTCCCGATTC CGGAATTCTG GGCGCCTATG ATCGCGGTGA TGGACGGATC CTGCTCGACC TGCTGGAGGT CCCTTCCGAG TGGGTCGATG AATGGATCCG GCTGACGGCG ACGGGGAACG GGGGCTTCAC GCTTTCCCTG CCGGATGGAC GTCGGCTCGA CGGGCAGGTT GGCGAGCCGC TCCTGTGGCC GGAAGCGGGG TTCGGCCTGC GGATCGTCGG GCTGGAGGCA CCCGCAGGGC GACAATTCCG GCTTCGCCGG CAGGACGAGA TCGGGGCCAT CGAGGCATTG CGCGGGCGGC TTTCGGTCAG CGAGCGCGGG CGGGGGGCCC TGATCCTCGA CGTGACCCTG ACGGGCCCGG ATCCTGTCGA GGCCCAAGCT GCGCTGGCCG CGGTCACCGA CGCCTATCTG CGGCAGAACA GGGGCCGGAG CGCGGCAGAG GCGCAGAGCA GTCTGGATTT CATCGAGCAC CAGCTTCCCG GGGCGCGCGA AGCGGTTGCA AGGGTCGAGG ACCGGCTCGA GGCCTATCGT CAGGCCCAGC ACACGCTCGC CCCTGACCTT GAAAGCCTGA GCCTTCTGAA CGAGATCCGT GCGGCCGAGA CGGAGCTGCG CGAGCTGTCG GGGAAAGAGG AGGATCTGGC TCGTCGTTTC ACACCCCTGC ATCCGGCCTA CCAGAAGCTC CTCGCCGCAC GGGCGCGGGC GGAGGACCGG CTGGCCGGGT TGCGCCAGGA GGCGGCCGGT CTTCCGGAAA CCCAACGGGG TCTCTTCAAC CTTTCACGTG AACTGGACGT TGCTCGCCAG GTTCATCTGG ACCTGCTGAA CCGGGCGCAG GAACTGCGCG TCCTCACGGC GAGCACGCTC GGCAATGCCC GCCTCCTCGA TGCGGCCCGG GCAAGACCCG CTCCCGTGGC CCCGCGACGG GGACGGGTGC TGGCACTTGC CCTTCTTCTC GGCGCGCTTG GCGGCGCAGG GTATGTGCTC GGGCGCAACT GGCTGCAGGC GGTCATCCTC GGCCCCGAGG ATCTCGACCG GCTGGGATTG CCGGTCTATG CCACCGTGCT TCTTGCGCCG CAGGCGGTGC GTCAGCGAGG AGACCGCCGT CCCTGGCCGA TCCTGGCCCT GACCGATCCC GATTGCGTCA CCCTGGAGGG GATCCGGTTG CTGCGGGCGG GGCTGCACTT CGGGCGTGAG GCAGCACGCA GCCGCTCGGT CGGTTTCACC TCGCCATCCT CCGGAGCGGG CAAGTCCTTC CTCGCAGCCA ACCTTGCGGT GGTTGAGGCA CAGGCGGGTC AGCGGGTCTG CCTCGTCGAC ACCGACCTGC GGCGGGGGGA TCTCCGGCGC TACTTCGGGA TGTCGAGGGG AACGCCGGGG CTGTCGGACT ATCTCTCTGG CTCGGCGGCC GTCGACGATC TTCTTCGCCC GGGGCCGGTG GAGGATCTGA TGGTGCTGAC CGCAGGCCGA CTTCCACCAC ACCCTTCCGA GCTTCTCCTG CGGCCGGCCT TCGCCCACCT CGTCGCGGAA CTCGACCGCC GCTTTGACCT TGTGATCTTC GACATGCCCC CGGCGCTCGC CGTCACCGAC GCGGCCGTGA TCGGTCGCAC GATGGGGATC ATGCTGGCGG TCCTGCGCCA CGCCATCACC GAGCGCGAGG AGGTGGAGGT CATGATCCGC CAGATGCAGG GCGCCGGCGT GACACTCGGG GGCGCGGTGA TCAACGGCTA TCGGCCCTGC GGCCGCCGAG GCGCCTACGG CTATCGCTAC GACGACAGCT ACAGCTATCG TTCCGGACAG GAGGCCTGA
|
Protein sequence | MARVASDEDE IDLGRILAQL WAGRFRIAGS TAAAAVLALV HLADTPPTFR AEALLQLEEK AAQALPAALS DIAGLEPRIA AEIEILRSHS VLAEAVAAHR LDLQALPVQA PVLGHAVASG RLPLPDSGIL GAYDRGDGRI LLDLLEVPSE WVDEWIRLTA TGNGGFTLSL PDGRRLDGQV GEPLLWPEAG FGLRIVGLEA PAGRQFRLRR QDEIGAIEAL RGRLSVSERG RGALILDVTL TGPDPVEAQA ALAAVTDAYL RQNRGRSAAE AQSSLDFIEH QLPGAREAVA RVEDRLEAYR QAQHTLAPDL ESLSLLNEIR AAETELRELS GKEEDLARRF TPLHPAYQKL LAARARAEDR LAGLRQEAAG LPETQRGLFN LSRELDVARQ VHLDLLNRAQ ELRVLTASTL GNARLLDAAR ARPAPVAPRR GRVLALALLL GALGGAGYVL GRNWLQAVIL GPEDLDRLGL PVYATVLLAP QAVRQRGDRR PWPILALTDP DCVTLEGIRL LRAGLHFGRE AARSRSVGFT SPSSGAGKSF LAANLAVVEA QAGQRVCLVD TDLRRGDLRR YFGMSRGTPG LSDYLSGSAA VDDLLRPGPV EDLMVLTAGR LPPHPSELLL RPAFAHLVAE LDRRFDLVIF DMPPALAVTD AAVIGRTMGI MLAVLRHAIT EREEVEVMIR QMQGAGVTLG GAVINGYRPC GRRGAYGYRY DDSYSYRSGQ EA
|
| |