Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3864 |
Symbol | |
ID | 5541368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5050359 |
End bp | 5052335 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640895973 |
Product | hypothetical protein |
Protein accession | YP_001433918 |
Protein GI | 156743789 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0913567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.365834 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGATC TTCCGATTCG CCGTATGCTG GTGTACCGGC ATGGCGTTGG TTACTTCGAG CGACGGGGTC CAATCACCGG TACAGAACTG CGTCTGACGT TCCCACGGGA AGCCATGGAC GACATTCTGA AAAGCCTGAT CGTCCTCGAT CTGGGTGAAG GTCAGGTTTT GGGTGTCGAT ATCGAAACGC CGGAGGATCG CGCAAAGCAG ATCGAGCGCG GCTCCATCCA CCTTTCCAAT ACGCGCAGTC TGCTCGATCT GCTGCGGGAT TTGCGCGGAC GCAACGTGCG CCTTCAGATT GAGCACGAGC GCCGCCATGG TGATGCCGCC GATGAGGTAA TCGAAGGCGC AATCATCGGT ATTGATCTCG ATGAGCAAGA ACCGCTCGAC AAACCGCTGC TTTCGCTCTA CCTGTCGAAA CAGCGCAACG TACGCACCAT ACCGGTGCGG CGCATCGCGC ACCTGGTAAT TCTCGATGAC CGTGCGGCAG CCGATATGGC GTATTTCCTG CGCGCGGCAC AGAGCGAGGA AGATCGACGA TCCGCCATTG TGCGACTGTC GGAGGGCGAC CACGACATGC TGGTGGGATA CATCGCTCCT GCCCCCTCCT GGCGCGTCAG TTATCGCCTG CTGGCGGAGC CGAAACCCGA CGGCAACGAC TCCGCCAATG GCGGTGGACG TAGCGCAGGT GCGCAAGTGG CGGTTCTCCT TCAGGGGTGG GGACTGTTCG ATAATCAACT CGACGAAGAC CTCGAGAGCA TCGAGTTGAC GCTGGTCGCC GGCATGCCAG TGTCGTTCCG CTATCGCCTG TATGAACCCC ACACGCCTGA ACGACCCATG GTACAGGACG ATGTGCGCAC GGTGGCGGCG CCGATTGAGT TCCAGGCAAA CCGAGCGATA CCAAGCTTAA TGGAAGTCGC TCCGGATTTG GATGAATTTG CACTCGGCGA GGCGTCCGCA CTCAGGATGG AGAATCTCGA ACAGTCCATT GAAGCCGCCG GCGTCGGTGA AGAACGTGGC GCTCTCTTTC AGTACCGTGT TGTGCATCCT GTCAGTGTGG CGCGGGGACG ATCCGCCATG GTTCCGATTG TCAGCCGACG CCTCGATGGA CGCAAAGAAT TGCTCTACAA CGGTCGCAAA CTGCCTCGCC ACCCGGTTGC AAGCCTGCGC ATGCAAAACG AAACCGGGCT GACGCTCGAA CGCGGACCGG TGACGGTCGT CGAGCATGGC GACTACGCCG GTGAAGCCGT GTTGCCCTTC ACGCGCGCAG GGGCTGAAAT GATTATCGCG TATGCGGTCG AACTTGGGGT GACGATCAGC GAAGAACGCC ATCATCAGCG CACAATGGCG GGGTTGAGCA TCCACAAAGA GTATGCGGTG TTTGAGGAAT GGGATGTCCA GCAGATGCGC TACCGCATCA CCAGCACCCT GCCCGACGCC GTAAACATTG TGATCGAACA GGAACGGTTG AAGGGCTACG ACCTGTTCGA CACACCCGCT CCAGACGAAG AGGCGCACAA TGTCGCGCGC TGGACAGTGC GGTGCCCGTC TGGCGTCGAA ACCGTTTTCA TGGTCAACGA ACGCTGCAAG AGATCACGCC ACGAGGAGGT GCGCAAACTC GATATGCACC GGTTGCAGTC GTTCCTGAGC GACAGATATC TCGATCAGGC GACCTACCGA GCGCTCGAGC GCATTCTGTC GCTGTATGAT CAGGTTGCGA AGCGCCGTGC AACGCTTCAA GAGATTGCCC AGGAGCAGCA GAAAATCCTG GCGCGCCAGC AGCAGATCCA GGCAAACCTG GGACCGTTGG GGCGCGAGGG GAGTGAACGG GCATTGCGCG AACGGTATGT CGCGCAGCTC AATCAACTCG AAGATCGCCT GAATGACCAG CTTGCCCGCG AGCAGGAGAC CCGCAAGGCA ATCGAGCGAC TGGAGCAGGA AGCAGCACAG GCGCTTGCAG CATTGTCGAA GCCATAA
|
Protein sequence | MPDLPIRRML VYRHGVGYFE RRGPITGTEL RLTFPREAMD DILKSLIVLD LGEGQVLGVD IETPEDRAKQ IERGSIHLSN TRSLLDLLRD LRGRNVRLQI EHERRHGDAA DEVIEGAIIG IDLDEQEPLD KPLLSLYLSK QRNVRTIPVR RIAHLVILDD RAAADMAYFL RAAQSEEDRR SAIVRLSEGD HDMLVGYIAP APSWRVSYRL LAEPKPDGND SANGGGRSAG AQVAVLLQGW GLFDNQLDED LESIELTLVA GMPVSFRYRL YEPHTPERPM VQDDVRTVAA PIEFQANRAI PSLMEVAPDL DEFALGEASA LRMENLEQSI EAAGVGEERG ALFQYRVVHP VSVARGRSAM VPIVSRRLDG RKELLYNGRK LPRHPVASLR MQNETGLTLE RGPVTVVEHG DYAGEAVLPF TRAGAEMIIA YAVELGVTIS EERHHQRTMA GLSIHKEYAV FEEWDVQQMR YRITSTLPDA VNIVIEQERL KGYDLFDTPA PDEEAHNVAR WTVRCPSGVE TVFMVNERCK RSRHEEVRKL DMHRLQSFLS DRYLDQATYR ALERILSLYD QVAKRRATLQ EIAQEQQKIL ARQQQIQANL GPLGREGSER ALRERYVAQL NQLEDRLNDQ LAREQETRKA IERLEQEAAQ ALAALSKP
|
| |