Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1728 |
Symbol | |
ID | 5539206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2223509 |
End bp | 2225428 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640893867 |
Product | hypothetical protein |
Protein accession | YP_001431838 |
Protein GI | 156741709 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.633344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGTC TCCCTGATCG TCATCGTTCA CCGCGTTCCG GGCGTGTGCC CGCGCGCTGG ACATTCTATG AAGTGCTGCT GATTGCGTTG AGTGTGGCGC TGATCATCTT CCCGCTCTAT GCAGAAGTCT ATCGCTGGGT GTATCCCGCT TTTGCGCCAG CCGCTCCGCG CGTCCAGGAA GCGCCGACCG ACACGCCAAC CGACGTGCCG TCCGAGGCGC CGACCGACGC GCCCGCTGAG CCGCAGGCGC GCGTGACCGA AACCCCCGAA CCAAGCCCAA CAGCCGGTCC AAGCCCCACG AGCAGCCCAA CGTCGGTGAA CACTCCGACG CCGACCAGTA CGCCAACCGC AACACCGGGC GACCAGACGC CGGTGGCGAC CAATACACCG ACTCCGACCA GCGATGCGGG GGTGACGATC ACGCCAATTC CGACGTTCTC GCCTACCAGA ACGCCAACGT TTACCTCCAC GCCTGGTCCC AGCCCGACGC CGGTGCTCGG CGTCCCGCCG TTGACCCTTT CGAAGTCCGC CTCGGTGCAG TTTGCTTCGC CAGGACAGGA ATTCACCTAT TCGCTGGTGG TTGACACGAA TTCATCGGTT CCGATCCAGA TTGAGGTGCG CGATCCGATT AATGCGCAAC TGGAGGTGAC CGGAACGAGC GTTTCAAACG GGTCGTGCCA GGTAAGCGGC AATACCGTTG TGTGTAATGT TACTGCCGTT GTCAATCAAC CGGTGTCGAT CAATATCAAT GTCCGTGTGC GCACCGGCAT TCAATCCGAA ACTCTTATCG CCAACCAGGC GGCAGCGCAG GACGCGCGCG GTTTCACGGC AGCCTCCGAT TCGGTATCTG TGCGCATTCC GGGCGGCATC ATTCCGCCGA CGCCGTCGCC GGACCCCAAT CGGCCAACGT CGCCGCCAGC AACCCCAACA CCGCCAATTG CTCCTTCGCC AACCCAGCCG TCAACCCCGC CACAACCACC GGCGCCTCCG CCTTCCGGCG GCGATGGCGG CGGAGGAGGC GGCACTCCCC CGGAAGGCGC GCCGCCTCCT CCGCCGGTGA TCGACCTGGT GCTGCCGCCG ACGCCGGCGC CGGTTGCGCC AGCGCCGCAA CAGCCGCAAC AGCCGCGTCC GACTCGCTCC GCCGGTACTG GAGGCGCGCG TACTGCGACG CCAACGGCAT CGCCGACGAC GATTGCTGCC ACCGACGCGA TCTTCTTCCG CATGGCGAGC AACTGGGGGA GTGCCTTCCC CGGTGATGCC GTGACGTATG TGATCGCTGT GCGCAATACG CATCCGTCCA ATTCACTGCG CGATCTGGCG CTGCGCAGTG TGTTTCCGGC AAATCTGGAG ATCACCGGTC TTTCCTCCGG TCCGATCGAT CGCAATACGC CAGGGGCATT CACCCCCGGT GATCCGTCGC GCACCGACAA CCGCATTTCA CTGGGGGTCG CCGAGTTGCC TGCCGGTCAG GGGTTCGAGG TGATCGTCCA GACGAAGATT AAGCCGAATG TGTCGGGCGG GACACGTATC GTGGCGCAGG CGGAATTGAC CTTTGCCGGT CTCGCTATCC CGCTCTATTC GAATATCGTC ACTGTCGAAG TGGTCAACGC GGCGCAGGCG CAGGTTCTGC CGACGGATAC GGCGACCGCA ACGCCAACTG AAGCGCCAAC CGCGATGCCG ACCAGCACGT TGACGCCTGT GCCGGCCACC GAGCCGCCGA CGGTGGTAGT CGAAGCGGGG CAACCGGCGG CAGTCGCGCC AACGGCGACA CCCGGCGCGG CTGGGAGCGT CGTCGGTTCA GCGGGGACAG CGCCGTTGCC GGCAACCAGC ACCGGAGTTC CGCTGGCAGG CTTTGCGCTG CTCGGCGCAA CGCTCCTGGC GCGCACATGG CGGCTCCATC GCGCGAAGTC GCGTATCTGA
|
Protein sequence | MQRLPDRHRS PRSGRVPARW TFYEVLLIAL SVALIIFPLY AEVYRWVYPA FAPAAPRVQE APTDTPTDVP SEAPTDAPAE PQARVTETPE PSPTAGPSPT SSPTSVNTPT PTSTPTATPG DQTPVATNTP TPTSDAGVTI TPIPTFSPTR TPTFTSTPGP SPTPVLGVPP LTLSKSASVQ FASPGQEFTY SLVVDTNSSV PIQIEVRDPI NAQLEVTGTS VSNGSCQVSG NTVVCNVTAV VNQPVSININ VRVRTGIQSE TLIANQAAAQ DARGFTAASD SVSVRIPGGI IPPTPSPDPN RPTSPPATPT PPIAPSPTQP STPPQPPAPP PSGGDGGGGG GTPPEGAPPP PPVIDLVLPP TPAPVAPAPQ QPQQPRPTRS AGTGGARTAT PTASPTTIAA TDAIFFRMAS NWGSAFPGDA VTYVIAVRNT HPSNSLRDLA LRSVFPANLE ITGLSSGPID RNTPGAFTPG DPSRTDNRIS LGVAELPAGQ GFEVIVQTKI KPNVSGGTRI VAQAELTFAG LAIPLYSNIV TVEVVNAAQA QVLPTDTATA TPTEAPTAMP TSTLTPVPAT EPPTVVVEAG QPAAVAPTAT PGAAGSVVGS AGTAPLPATS TGVPLAGFAL LGATLLARTW RLHRAKSRI
|
| |