Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2745 |
Symbol | |
ID | 5540231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3549631 |
End bp | 3553434 |
Gene Length | 3804 bp |
Protein Length | 1267 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640894871 |
Product | glycosyl transferase family protein |
Protein accession | YP_001432834 |
Protein GI | 156742705 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0438] Glycosyltransferase [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.636531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGCG CCGTAATCGT ACTGACATGG AACGGCGGCG CCGAGGCAAT CGCCTGCCTG CAACGGGTGC GCCAGCTCAA CCCGGCGCCC GACATGGTTC TGGTGGTGGA CAACGACTCG CGCGACGGCA CGCCGGAGCA GATTGCTGCC CTTTTTCCCG ATATTACGCT GATCCGAAAT GCGCAGAACC TGGGATATGC CGGTGGGATG AACATCGGCA TCCGCGCATT GCTGGCGCAT GAATCGCCGC CGGACATCAT CGTTCTGCTC AATCAGGACA CGCTGGTTGA TCGAGAATGG CTCGGCGCGA TCACCGCTCC GTTTTGCGAT CCTGAAATCG GCGCCGTGGG ATGCAAGATC CGCTACCCCG ATGGCACGAT TCAGCATGCC GGTCTCACCC TCGACTGGCC CCTGGCGTTT TCCCGCCATG TTGGGAGGTA CGAGCCGGAT CGTGGGCAGT ACGATGCACC GCGCGATGTC GAGTTCGTCA CGTTTGCGGC GGTTGCCCTG CGACGCCAGG CGCTGGAACG TATTGGATTA TTCGATGAGG GGTATCGCCC CGCTTACTTC GAGGATGTCG ATCTCTGCGC GCGACTGCGA CGTGCTGGCT ACCGCATCCG CTACGAGCCG CGCGCAACCC TGACGCATCG AGAATCCACG TCGCAGCGAG ATGACCTGGT GCGCAGCGCC ATCGCTCACC AGGGTCGTTT ACGCTTTGTG TTGAAAATGT ATCCGTTCGA AGCGATCACC GGCGCATTCG CTGAAGCCGA ACAGGCCTTT CTCGTTCAAC ACAGCAATCC ACCAGAGTGT CGCGCCCTGC GCTGGGCGTA TGACCGAACG CTTGCTGAAA TGACGGAAAT CCTCCATGCC CGGCGCAATT GTGATCCTGA CATGCCTTCG GATACGCTGA TGACGCTGCG CGCATTGTTG CTCGATCTGC GCCATACTCT GGACACGCGC CTGCTCCAAC GGCTCCGCGC GCGAGCGGAA GAAATCAGCG ATCTTGTGAC AGACTATATC GACTCTCTGG CGGTGCGCTA TACGCTGCTG AGCCAACCAC CAACGCGCCT CGACTGGAGC GAGTCGTTTC TGATTGACCT GAAAGTGGAA AATAGCGGAT TCGCTCCCTG GCGCGGCATC GGCGACCACC CGGTCCGATT AGGCTATCAG TGGATCGATC AGGCAGGGAC GCGCCACGCA GGGCGTCACC GGTCTGCAAT TCCACAATCC GTGCATCCTG GAGAGAGCAT TCGTCTTGCG CTGCGCATCG ATCCGCCGCC CGCACCCGGA ATGTGGCGGT TGCAGATCGA ATTGGTCCGA GAATATATCG ACTATTTCAG CACCTATGGC ATCCAACCCC TGTTCCTGAA TATTGAGTAT GTCCTTGAGC CAGCGCCGCG CGCCGTGATT CTTAGTTTTG CCATCGCCGC ACACGACGCG GTCGGCAGCA ATATTCTCGC CCAGGTTCAG GCGCTCCGCC AGACGGGTTA CCGCGTTCTT ATTCTGGCGG AATACGCTGA CGAACGGCTC CCGACTGACA CGCTGCTTGC GACCGTTACG ACCAGACGGA ATCTTCTGCA TGAGCATCCG GCGGTTCTTG AGCACATGCG ACGCGCGGCT GTCATTATTG CGCATTATCC GCTTTACTAC GATCTCGTTG AATTGATCCG CAGCGCGGGC GACAGCGTGG TTATTCTGGA CTATCACGGC ATCACACCGC CGGAAATCTG GGGCATCGAA ACGGTCTATT ATTATTCCAG AATGGTGCGC GGCATCACCA TGCTTTCGCT GGCGCAATAC GCCGATTATG CCATTGGGCA TAGTTTCTTG ACGTGCGCTG AACTGATCGC AACCGGGGTC ATTGATCCTG AACGGATCGA GCAGATTCCC TGCCCAATTG CCGCGCATCC GACGCTCGCC GGTCCGCCTG CGCCCGAGAT AGTCGAACAA TTCGGCCTGC GCCATCAGCA CGTGCTCCTC TATGTTGGTC GCATAGCGCG CAGCAAGCGC ATCCACATGC TCGTCCAGGC GCTGCCAATC ATTCTGTCGC GCCACTCCAG GACCATGCTC GTGCTGGTTG GCGACTATCG CCCTTCGATC TACCGGCAGT ATGCCTACGA GATTGAAGCG CACGCGCGCG CACTGGGGGT TGACGCGCAT GTTCGGCTGA CCGGTCAGGT GGACGATGAG ACCCTTGAGC AACTGTACCG CGCCTGTGCC ATGTTCGTCA CCGCCAGTCT CCACGAAGGG TTCTGCATGC CAGTTGCCGA GGCGATGGCG CGCGGTCGTC CGGTAGTTGC AACGCGGGTC GCCGCCCTGC CCGAAACGGT GGGAGACGCC GGATTGCTGT TTGATCCTGA CAATACGGCG GATCTGGCGG CACATGTGTG TCGGTTGCTC GACGATCTGC CTCCTCTCGA AGAAGCGAGC GACCCGCTGG CGGTCTTGCG CCTTGCGCCT GCCACCGAAG AAGATTTTGC CAGGCTGCAC GAGCGCAAAA TCGGCATCGT CACGCCGCGC TATGGCGTAG ATGTGCTGGG AGGCGCAGAA AGCGGAATCC GCGGATGGGC GGAACAACTG GCTGCGCGCG GTTATACTGT TGAGGCGCTG ACCACAACCA CCATTGATAT GGTCAACTGG GGAGATCATA CGTCTCCCGG AGTCGAACAT CTGAATGGTG TCACCATCCG AAGGTTTCAC ACATCCAGTG TGGACAATCG ACCATTCCAT GCCCTGCGGT TGAAGGTTGA CCGCGGCGAA CGCCCGCGCT TCTGCGAAGA AGAACGCTTT ATGGAAAGCA ACCTGCGGAG CGCCGATCTC GAACGCTTTA TCGCTGAACA CGCCGCAGAG TACGCCTGTT TCCTCGTGAC GCCATATCTG TTCGGAACCA GTTACTGGGC AATCCAGCGG GCGCCGGACC GCTCAATTCT CATCCCTTGC CTGCACGACG AACCGCTGGC GCGGCTCAGC ATTGTTCGCC GTATGCTGGA ACAGGCTGCC GCTCTCTTTT TCAATTCGGA AGAAGAGAGC GACTTTGCTC TGTGCGCGCT CGGCGTGGTG AATCCCTATC GCACATGTCT CGGCTTTGGG TTTCCCGACC AACCGGAGCG GGGGGACCCA CTGCGCTTCC GCCAACGAAC CGGAATAACA GGTCCGATGC TCCTCTACGC CGGTCGCCTG GAACCAGGCA AGAATGTTCC GCTCCTGATC GAGTATTTCA TGCGCTACAA GGCAGAACGC CCCGGTCCAT TAACGCTGGC GTTGAGCGGA ACCGGCAGCA TTCCTCTACC ATCGCGCGAT GATATTGTCG GATTGGGGAT GCTGCCGCGT GATGCGCTGA CCGACGCTTA CGCCAGCGCA GTTGCGCTTT GTCAACTCTC GCTGAATGAG AGTTTCTCGC TGGTGCTGAT GGAATCGTGG CTTCAGAGCC GTCCGGTCAT TGTGCATGCG GATTGCGCCG TGACTCGCGG GCATGTTGAA CGCAGCGGCG GTGGATATGC TATTGGCTCC TATGAGGAGT TTCGCGCAGC GGTGGATGCT CTGTTGGCGG ACGAAACAGC AGGCGCAGCA CGTGGTGAGC GTGGAAAAGC CTATGTGCTC GAACGCTACA CCTGGAGCCG ATTGCTGCCG CGCATTGAGG AGAGCATCGC CCGCTTCAGC CGCCCGCGTC CGTTGTATGC GCGCCTGGCA CAGCGTAGCA TCGCGCGCGC CCTGTCGTTC ACACGGCAAC GCTACGAAGA TGACTTTCTG GACCTGATCG AGCGCGCAGT TGCGGCGGCA CAGGCGCGCA AGCAGGAGGG ATGA
|
Protein sequence | MSSAVIVLTW NGGAEAIACL QRVRQLNPAP DMVLVVDNDS RDGTPEQIAA LFPDITLIRN AQNLGYAGGM NIGIRALLAH ESPPDIIVLL NQDTLVDREW LGAITAPFCD PEIGAVGCKI RYPDGTIQHA GLTLDWPLAF SRHVGRYEPD RGQYDAPRDV EFVTFAAVAL RRQALERIGL FDEGYRPAYF EDVDLCARLR RAGYRIRYEP RATLTHREST SQRDDLVRSA IAHQGRLRFV LKMYPFEAIT GAFAEAEQAF LVQHSNPPEC RALRWAYDRT LAEMTEILHA RRNCDPDMPS DTLMTLRALL LDLRHTLDTR LLQRLRARAE EISDLVTDYI DSLAVRYTLL SQPPTRLDWS ESFLIDLKVE NSGFAPWRGI GDHPVRLGYQ WIDQAGTRHA GRHRSAIPQS VHPGESIRLA LRIDPPPAPG MWRLQIELVR EYIDYFSTYG IQPLFLNIEY VLEPAPRAVI LSFAIAAHDA VGSNILAQVQ ALRQTGYRVL ILAEYADERL PTDTLLATVT TRRNLLHEHP AVLEHMRRAA VIIAHYPLYY DLVELIRSAG DSVVILDYHG ITPPEIWGIE TVYYYSRMVR GITMLSLAQY ADYAIGHSFL TCAELIATGV IDPERIEQIP CPIAAHPTLA GPPAPEIVEQ FGLRHQHVLL YVGRIARSKR IHMLVQALPI ILSRHSRTML VLVGDYRPSI YRQYAYEIEA HARALGVDAH VRLTGQVDDE TLEQLYRACA MFVTASLHEG FCMPVAEAMA RGRPVVATRV AALPETVGDA GLLFDPDNTA DLAAHVCRLL DDLPPLEEAS DPLAVLRLAP ATEEDFARLH ERKIGIVTPR YGVDVLGGAE SGIRGWAEQL AARGYTVEAL TTTTIDMVNW GDHTSPGVEH LNGVTIRRFH TSSVDNRPFH ALRLKVDRGE RPRFCEEERF MESNLRSADL ERFIAEHAAE YACFLVTPYL FGTSYWAIQR APDRSILIPC LHDEPLARLS IVRRMLEQAA ALFFNSEEES DFALCALGVV NPYRTCLGFG FPDQPERGDP LRFRQRTGIT GPMLLYAGRL EPGKNVPLLI EYFMRYKAER PGPLTLALSG TGSIPLPSRD DIVGLGMLPR DALTDAYASA VALCQLSLNE SFSLVLMESW LQSRPVIVHA DCAVTRGHVE RSGGGYAIGS YEEFRAAVDA LLADETAGAA RGERGKAYVL ERYTWSRLLP RIEESIARFS RPRPLYARLA QRSIARALSF TRQRYEDDFL DLIERAVAAA QARKQEG
|
| |