Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0484 |
Symbol | |
ID | 5537947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 624289 |
End bp | 626235 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640892646 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001430632 |
Protein GI | 156740503 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.339951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTT CGATGCCTGC GTTTCATTTC TATGGGATAG ATGCACTGGC GCTGATTATC GCCGCGTTTC TCGGATATGC TTTGCGCCTC GAACGTCTCG ATCTGAGGGA ATACTGGGGG GCGTATGCGC TGTTTACAGG CATAACCCTG ATCGTTGTCC CTTCGACATT TGCGATGGTT GGCGTGTACG CACAGTACTG GCGCTATGCG TCGTTCTACG AGTTTAGCCT GCTGGCAGTT GCGCTGGCGT TCGCCGGAAT CATGCTGACA GGTCTTGTTC TGCTCATGCG CGCGCTGTTC CCGCACTTGC CGGTTGTGCC ATTGTCGGTT CCGCTGATTT TCACCATGCC CGCGATGGCG CTGACGGCGC TGCCCCGCCT GAGCGTGCAG GCGCGCTTTC GCCCATCGTC GTTGCGTCAT TCACAGCACA GCAGTAATCG GGTGCTGATC ATGGGCGCGG GCGAAGCGGG CGCAATGATT GCGCACAGCC TACGCACTGC GCGACGAAAC ACCATCATCG TTGGCTTCGT CGACGACAAC CCAGGCAAGC GCGGCGTGCG GATCAACGGC GCGTCGGTCC TGGGCAATCG GCACGATATT CCGCGCCTGG TCGCCGATCA CCACGTCGAT GAGGTCATCA TTGCGATGCC GGGCGTTCCC GGAAAAACGA TCCGCGACAT TGTGTCCATC TGCGAGCGGG CCGGCGTGCG CGCGAAAATC ATCCCCGGAC TTGCCGAACT GGTTGATGGA CGCTTTAGTG TCAACCACAT CCGCGATGTG CAGATCGAAG ACCTGCTCCG ACGCGAACCG ATCAGTACCG ATATGCAGGC GGTGAGCGCG CTGATCCGTG GACGGCGCGT TATGGTCACC GGCGGCGGCG GTTCGATTGG TAGCGAAATC TGCCGCCATG TGCTGCGCTA TGAACCGGCT GAACTGATCA TTCTCGGGCA CGGTGAGAAC AGCGTCTTTG CTATTCACAA CGAGCTGCAA CAATGGCTGA ACAACGGCTG CAATGAATCC GGCGTTCCGC GCAGCACTAC TCTTCTGCGC ACGGTCATCG CTGATATTCG CTTTACCGAA CGGATCCATT CCGTATTCGA GCAGTACCGT CCAGAAATTG TGTTCCACGC TGCGGCGCAC AAGCACGTGC CGCTCATGGA AGCCAACCCG GTCGAGGCGG TCACCAATAA TGTGCTCGGA ACGCGCAACC TGCTCGATGC CTCGATTGTG ACCGGCGTCG AACGCTTCGT GATGATTTCG ACCGATAAAG CCGTCAATCC GACCAGCATC ATGGGCAGCA GCAAGCGCGC TGCCGAGTTG CTGGTGCATC ACGCCGCAAA ACGCAGTGGA CGGGCGTTCA TGGCGGTTCG TTTCGGCAAC GTCCTCGGCA GTCGCGGCAG TGTGGTATGG ACGTTCAAAC AACAGATTGC AATCGGCGGA CCGGTGACTG TCACCCATCC TGAAATGCGC CGCTATTTCA TGACCATCCC CGAAGCGGTG CAACTGGTGT TGCAGGCGGC GGCGCTTGGT CAGGGCGGCG AGGTGTTCAC GCTCGACATG GGCGAGCCGG TCAAAATCCT CGATCTGGCG CGCGACATGA TCGAACTCTC CGGGTTGCAG GTCGGGCGCG ACATTGATAT TGCCTTCGTC GGGTTACGCC CAGGCGAAAA ACTCTACGAG GAACTGTTCC TGCCCGGCGA AACGTATGAT CGCACCGGTC ACGAGAAAAT CTTCATCGCC CGCCATGCCG GGCGACTCGT GCCGCCCGAT GTGCTAGCGC TCATCGCCGA TCTCGAAGAA GCGGCGCTCG CGGACGACGC GCACCGCACT CAGCGCTTGC TGCGCCTTAT TGTGGAACGC AGCCAGGCGA CATCCATCGA GATGCTGCAC ACTGATCGGG CGCAGGCGCG CCCCGATCTG CGCGCGCTGA CGGCAGGGAG CGTATGA
|
Protein sequence | MARSMPAFHF YGIDALALII AAFLGYALRL ERLDLREYWG AYALFTGITL IVVPSTFAMV GVYAQYWRYA SFYEFSLLAV ALAFAGIMLT GLVLLMRALF PHLPVVPLSV PLIFTMPAMA LTALPRLSVQ ARFRPSSLRH SQHSSNRVLI MGAGEAGAMI AHSLRTARRN TIIVGFVDDN PGKRGVRING ASVLGNRHDI PRLVADHHVD EVIIAMPGVP GKTIRDIVSI CERAGVRAKI IPGLAELVDG RFSVNHIRDV QIEDLLRREP ISTDMQAVSA LIRGRRVMVT GGGGSIGSEI CRHVLRYEPA ELIILGHGEN SVFAIHNELQ QWLNNGCNES GVPRSTTLLR TVIADIRFTE RIHSVFEQYR PEIVFHAAAH KHVPLMEANP VEAVTNNVLG TRNLLDASIV TGVERFVMIS TDKAVNPTSI MGSSKRAAEL LVHHAAKRSG RAFMAVRFGN VLGSRGSVVW TFKQQIAIGG PVTVTHPEMR RYFMTIPEAV QLVLQAAALG QGGEVFTLDM GEPVKILDLA RDMIELSGLQ VGRDIDIAFV GLRPGEKLYE ELFLPGETYD RTGHEKIFIA RHAGRLVPPD VLALIADLEE AALADDAHRT QRLLRLIVER SQATSIEMLH TDRAQARPDL RALTAGSV
|
| |