Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3654 |
Symbol | |
ID | 5210632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4570046 |
End bp | 4572016 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640597247 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001277959 |
Protein GI | 148657754 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.772925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACGAT CATTGCCTGC CGGACATCTC TGCACAATCG ATGTGTTGAT GTTGACTATT GCGGCGTTCG CCAGTTATGC ACTACGTCTC GAGCGTCTCG ATCTGGGTGA GCACTGGCGG TCGTTCGTGC TGTTTGCCGG AGCAGCGCCG GTTGTTGTGC TGATGCTCTT TGGGATGACA CGGGTCTATG CCCAGTACTG GCGCTATGCA TCGTTCCACG AGTTCAGTCT GCTGGCGTGG GCGCTTGTGT GCGCCGGGAT CGTTCTGGAA GGCATGGTGC TGATCGGGCG CGCCCTGTTT CCAGCGATAC CGGTTGTCCC GTTGTCGATC CCGCCGATCT TCGTCTTATC CGCGCTGACA CTGACCGCAT TGCCGCGCCT GATGATGCAC GCGCGATTTC AACCTTCCCC CCGGCGCTGG CATCGACAGA GCGGTAATCG TGCGCTGATC ATGGGCGCCG GTGAAGCCGG CGCAATGATT GTGCAGAGTA TGCGTCTTGC CCGGCAGAAT AATGTTATTG TGGGCTTTGT CGATGATAAC CCGCACAAGC GAGGCGTGCG CATCAATGGC GCGCCGGTGC TCGGTGATCG TCACGATATT CCGCGTCTGG CGGCAGAGTA TCAGATCAAC GAGGTGATCA TCGCCATGCC GAGTGCGCCG GGCAAAACCA TCCGTGAGAT TGTTGCGATC TGTGAACGTG CCGGTGTGCG CGCTCGCATC ATCCCCGGAA TAGCCGAACT GGTCGATGGT CGGTTCAGCG TCAATCATAT CCGCGATGTG CAGATCGAAG ACCTGCTCCG CCGCGCGCCG ATACAAACCG ATATGCAGGC GGTAGGGCGC CTGATCCGTG GGCGGCGCGT GCTGGTCACC GGCGGGGGCG GATCGATCGG GAGCGAAATC TGTCGCCACG TGCTGCGGTA CGAACCGTCT GACCTGATCA TTCTGGGGCA CGGTGAAAAC AGTGTATTCG CCATCCACAA CGAGTTGTAC CGGTGGTTGA ACACGCCACG CGGAGAGTCG GATAGCGTAG ACGGTGATGG ACAATGCCGG TCATACCGCA CGCCAACGCT GCATACGGTG ATTGCCGATA TTCGCTTCTC CGAGCGCATT CACGCGGTGT TCGAGCGGTA TCGTCCGGAG ATCGTGTTCC ATGCCGCAGC GCACAAGCAC GTTCCGCTGA TGGAAGCCAA CCCCGTCGAA GCGGTGACCA ACAATGTGCT TGGCACGCGC AATCTGCTCG ATGCCGCAAT TGTCACCGGC GTTGAACGCT TCGTCATGAT CTCGACCGAT AAGGCGGTCA ACCCCACCAG CATCATGGGC AGCAGCAAGC GTGCCGCCGA ACTGCTGGTG CATCACGCAG CGAAGGTCAG CGGTCGGGCG TTCATGGCAG TGCGTTTCGG CAACGTCCTG GGCAGTCGCG GCAGTGTTGT GTGGACGTTC AAGCAGCAGA TTGCCGCCGG CGGACCGGTG ACAGTAACCC ATCCAGAGAT GCGCCGTTAT TTCATGACCA TCCCCGAAGC GGTGCAACTG GTGTTGCAGG CGGCGGCGCT TGGTCGGGGC GGCGAGGTGT TTACGCTGGA CATGGGTGAG CCGGTCAAGA TTCTCGATCT GGCGCGCGAT ATGATCGAAC TCTCCGGGTT GCAGGTAGGG CGCGACATCG ATATCGCCTT TGTAGGGTTG CGCCCAGGCG AGAAACTCTT TGAGGAACTG TTCCTGCCCG GCGAGCAGTA CGACCGCACA AGCCACGAGA AGATTTTCAT TGCCAGGAAT GCCAGCCGGC TTGTCCCCGC CGATGTGCTC GCGCTGATCG CCGATCTTGA AGAGGCGGCT CTGTCGGACG ATACGTCACG CACCGTCCGG TTGCTCCGTC TCATCGTTCA GCGCAGTCAA TCGACGCCAC ACGAGGATGT GCACGGCGAT CATACGCTCG AGCCTGCCAG CCTGCGCGCG CTCGCAGTGG GTGGGTCGTA G
|
Protein sequence | MSRSLPAGHL CTIDVLMLTI AAFASYALRL ERLDLGEHWR SFVLFAGAAP VVVLMLFGMT RVYAQYWRYA SFHEFSLLAW ALVCAGIVLE GMVLIGRALF PAIPVVPLSI PPIFVLSALT LTALPRLMMH ARFQPSPRRW HRQSGNRALI MGAGEAGAMI VQSMRLARQN NVIVGFVDDN PHKRGVRING APVLGDRHDI PRLAAEYQIN EVIIAMPSAP GKTIREIVAI CERAGVRARI IPGIAELVDG RFSVNHIRDV QIEDLLRRAP IQTDMQAVGR LIRGRRVLVT GGGGSIGSEI CRHVLRYEPS DLIILGHGEN SVFAIHNELY RWLNTPRGES DSVDGDGQCR SYRTPTLHTV IADIRFSERI HAVFERYRPE IVFHAAAHKH VPLMEANPVE AVTNNVLGTR NLLDAAIVTG VERFVMISTD KAVNPTSIMG SSKRAAELLV HHAAKVSGRA FMAVRFGNVL GSRGSVVWTF KQQIAAGGPV TVTHPEMRRY FMTIPEAVQL VLQAAALGRG GEVFTLDMGE PVKILDLARD MIELSGLQVG RDIDIAFVGL RPGEKLFEEL FLPGEQYDRT SHEKIFIARN ASRLVPADVL ALIADLEEAA LSDDTSRTVR LLRLIVQRSQ STPHEDVHGD HTLEPASLRA LAVGGS
|
| |