Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4167 |
Symbol | |
ID | 5211151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5218431 |
End bp | 5220377 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640597756 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001278461 |
Protein GI | 148658256 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCAGC GACTTCAAAA CCGGCATTTT CTCCTTTTCG ATATCCTGCT CGTGCCGCTT GCGATCTACC TGAGTTTCGT CCTGCGGCTG GAAATGTTCA ATCTCGGCAG TTACTGGCTG GTATGCATGC AGTTCTGCCT GACGGCGGTC GTGACCACCC CCCTGGTGTT TCGCGCGCTG GGGATCTACC GTCGCTACTG GCGCTACGCT TCGTTTGAAG AACTGCTGCT GCTCTGTAGT GCAACGTCGA TTGCGCTGGC GCTTGCCACG CTTGTGTTTA CTCTGATCGA TGCGCTCTTG CCGGTGGTCG CAACGATGCC GCGCTCCATT CCTTTTATCG TTCCGCCCAT CGCAGCCACG CTTATCAGTG TGCCACGGTT GCTGGTGCGC ATCGGTGCAG CGCGCGAGCG TCGGCGTCGT GCAACTGACC GACCGGCGCC GGTGTTGATC ATGGGCGCTG GCGATGCTGC GTCGATTATT GTGCGTGAGA TTCAACGCAA TCCAAAACTC GGCATGGAGG TTGTCGGGCT GCTGGACGAC GATCCGGCGA AGCGTGGGCT GCGGTTGCAC GGCGTCGAAG TGATGGGTGA CCGCCACGCT ATTCCGACAC TGGTAGCCCG CCACAAGGTG CGTCAGGTAA TCATTGCGAT GCCAGGCGCG CCTGGTAAGG CAGTACGCGA GATTATGCAT ATTTGCGAGT CTGTTGGTGT GACAGTGCGC ATCATGCCCG GGGTTCACGA ACTGATCGAC GGAACGATCA GCGTCAGCAA ACTGCGCAAC ATCCAGATTG AGGACCTGCT GCGCCGTGCG CCGGTGCAAA CCGATACCGC GGCAGTGCGC GCGCTGATCG CCAACCGACG GGTCCTGGTA ACCGGCGGCG GCGGTTCCAT CGGCAGCGAA CTGTGCCGCC AGTTGATCCG CTGCGGTCCA TCGCACCTGA TTGTGCTTGG TCACGGCGAA AACAGTGTGT TCGAGATCTG CAACGAACTT CAGCGTCTGG CAGAAGCGCA CGCCGGTCAA TCGCCGCACA TTGTGCCGGT GATCGCCGAT ATTCGTGATC TGGAACGCCT GCGCGCGGTG TTCGAAATGC ATGCGCCGGA ACTCGTTTTT CACGCAGCCG CACACAAACA TGTTCCACTG ATGGAGGAAC ATCCGGTCGA AGCCATCAGC AACAATGTCA TCGGCACGCG CAACCTGCTC GACGTATCGC TCGAAACCGG CGTCGAACGG TTTGTGATGA TCTCATCGGA TAAGGCGGTC AATCCGACGA GCGTGATGGG CGCAACCAAG CGCATTGCCG AGATGCTGGT GCTCAACGCT GCGCGGATCA GCGGACGACC CTACGTGGCG GTGCGTTTTG GGAATGTGCT GGGCAGTCGT GGCAGTGTCG TGCTGACCTT CAAACGGCAG ATTGCCGCCG GTGGACCGGT AACGGTCACG CATCCGGAGA TGCGTCGCTA CTTCATGACC ATTCCAGAAG CGGTGCAACT GGTGCTCCAG GCGTCGGTAC TGGGGCGCGC CGGCGAGATT TTTATGCTGG ACATGGGGGA ACCGGTGAAG GTGGTCGATC TGGCGCGCGA CATGATCCGT CTGTCGGGAT TGGAGGTCGG GCGTGATATT GATATCTGCT TCACCGGCAT ACGTCCGGGT GAGAAATTAT TTGAAGAATT GTTCGCCCAC GGTGAAGAAT ATCAGCCAAC AGCGCACAGC AAAATCTTCA TCGCCGCTGG CGCCAGCAAC AATATTCCGC CCGACTTGCG CACGGATGTA GCGCTGCTCG AACAGGTTGC GCGCGCGAAC GACGATGCCG CCGCACGACG CATGCTGCGC CACATCGTCC CGGAGTACTG CCCGCCGTTG CCTGCCCCGC CGATACCTGT CGCTGAAAAT ACGCCCTATC CTGTGCTGGT GCGTCCATTG CAACCGCTGA TCGGGGGTGG ACGATGA
|
Protein sequence | MMQRLQNRHF LLFDILLVPL AIYLSFVLRL EMFNLGSYWL VCMQFCLTAV VTTPLVFRAL GIYRRYWRYA SFEELLLLCS ATSIALALAT LVFTLIDALL PVVATMPRSI PFIVPPIAAT LISVPRLLVR IGAARERRRR ATDRPAPVLI MGAGDAASII VREIQRNPKL GMEVVGLLDD DPAKRGLRLH GVEVMGDRHA IPTLVARHKV RQVIIAMPGA PGKAVREIMH ICESVGVTVR IMPGVHELID GTISVSKLRN IQIEDLLRRA PVQTDTAAVR ALIANRRVLV TGGGGSIGSE LCRQLIRCGP SHLIVLGHGE NSVFEICNEL QRLAEAHAGQ SPHIVPVIAD IRDLERLRAV FEMHAPELVF HAAAHKHVPL MEEHPVEAIS NNVIGTRNLL DVSLETGVER FVMISSDKAV NPTSVMGATK RIAEMLVLNA ARISGRPYVA VRFGNVLGSR GSVVLTFKRQ IAAGGPVTVT HPEMRRYFMT IPEAVQLVLQ ASVLGRAGEI FMLDMGEPVK VVDLARDMIR LSGLEVGRDI DICFTGIRPG EKLFEELFAH GEEYQPTAHS KIFIAAGASN NIPPDLRTDV ALLEQVARAN DDAAARRMLR HIVPEYCPPL PAPPIPVAEN TPYPVLVRPL QPLIGGGR
|
| |