Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1148 |
Symbol | |
ID | 5538614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1485289 |
End bp | 1486959 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640893280 |
Product | hypothetical protein |
Protein accession | YP_001431263 |
Protein GI | 156741134 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.591181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTACG CCCATGAACA GACCGAAACG TGGAACACGA CCGCCGAGCA GATTGATGAT ATTCCGATTC TTATTGCGCA TATGCGCAGA ATGGGCTTGC CTCAGGCGCT TGATCGTCAC ATTCCCACAC GCGCCTACTG GGGCAACCTC AGCGTTGGTT GGACTGCGGC GGTCTGGTTG ACGCATCTGC TGTCGTGCAG CGATCACAAA CCCGCCCATG TGCAGCAGTG GGTGGAGACC CATATCGCGG CGCTGCGCTG GTGCATCGGC AGCGAGATAA CGCATGCCGA CGTGGGCATT GATCGGCTCC ACGATGTGCT GATCGGACTC AGCCAGGACG ACCAATGGCA GGCAATCGAG ACCGATCTGA ACCGTCGTAT GCTCCGCGCC TGGCCATGGA CCTCTCGCCA GGTGAATCTG CGCCTGTACG AAGGGCGTTC GTGGTTTGTT GCGCCGAGCG GCGCCTTTCA GATCGCCAGA GTGCATCCCT GGCGCGCCAG AACGTTGCGC CAGTCAATTG TACTGGCAAC GATCCGTGCG TCGAATCTGC CGTTCGTAAC GTGGTCGTTC CCCGAGGATC ATGTTCCACC TGTATTGTTC GCCAGGATAC TGGAACGGAT CTCACAGGAC CTGCCCTCAC AACGGCTTCG ATTTATTGGT GACTCGCTGT TCGCTCCAGG ACTCCGGGGT GCGGTTCATA TGCGCAACGA TGAGTACCTC TGCCCGCTCC CCGATACGCA TCCCGATTCT CTCAATCCGC TCACAGCCCA TTTTGCTGCG CATACGCCCG CATGCGCGGC GGGGCGAAAT GGCAATCCTC ACCATCTCAC TGCTGACGAC AGTATCGAAT GGTACGCGCC GGTCAGTGTC GAGATCGATG GCGCGACCGT GGCGTGGAAC GAGCGACGCA TTGCTGTGCG TTCACCGGTG CAGGCGCATC GGCTGGAAGA AGCGCTCCGC ACCCGGTTAG TGCGCGCAGA GGCGGCATTG CTTGCGCTCG TCGAGCGTAA GCGTGGGAAA CGTCGTCCGC GTTCCCTTGA AGCGTTGCGC GAAGCTGCCC ACGCCATCCT CGACAGTTAT CAAGTTCATG GGTTGCTGCG CCTGGATTTT GCCGAGCAGG TCCAGGAGCG ACTTGTCCGC CGGTACCGCG GACGCCCAAC GGGCATGCGC GTCGAGCGTG ATGTGGGCCT GAACGTTTCG ACCGATGCCG ACGCGCTTGC GCAGGCGATC CGACGCCTGG GCTGGCAAAC CTTTGTGTCC AACATTGCTC CGCACGACCT GTCCGCCGAC CGTATCCTTG CCATCGCCGC TCCGGTCTCT GGATTCGAGC GTTTGAACGG TCGTCCGCTC TCGTTGGCGC CACATGAAGT GCATACTCCC GAACTGGAGA CCGGACTGGT GCGTCTGCTC GCTCTGGGGC TGCGCACTCT CGCACTGCTG GAAACGATTG CGCGCGACCA ATTGATCAAG GAAGAGGTGC TGTCCGCTTC GGACAGCGAA CGTGCGGCAT CGCGCACCAC CGGCGAGCGA TTGCTCGACG CCTTCCAGGA TATTATGCTC ACACCCGGCA TCAATCAGCG TCTAGGCGCG ATCACGCCGC TTTCACCGTT GCAACAGCGG GTGCTGCATC TGGTGGCGTT GTCGCCGGAT ATCTACCGGA TGCCCGGGTA A
|
Protein sequence | MPYAHEQTET WNTTAEQIDD IPILIAHMRR MGLPQALDRH IPTRAYWGNL SVGWTAAVWL THLLSCSDHK PAHVQQWVET HIAALRWCIG SEITHADVGI DRLHDVLIGL SQDDQWQAIE TDLNRRMLRA WPWTSRQVNL RLYEGRSWFV APSGAFQIAR VHPWRARTLR QSIVLATIRA SNLPFVTWSF PEDHVPPVLF ARILERISQD LPSQRLRFIG DSLFAPGLRG AVHMRNDEYL CPLPDTHPDS LNPLTAHFAA HTPACAAGRN GNPHHLTADD SIEWYAPVSV EIDGATVAWN ERRIAVRSPV QAHRLEEALR TRLVRAEAAL LALVERKRGK RRPRSLEALR EAAHAILDSY QVHGLLRLDF AEQVQERLVR RYRGRPTGMR VERDVGLNVS TDADALAQAI RRLGWQTFVS NIAPHDLSAD RILAIAAPVS GFERLNGRPL SLAPHEVHTP ELETGLVRLL ALGLRTLALL ETIARDQLIK EEVLSASDSE RAASRTTGER LLDAFQDIML TPGINQRLGA ITPLSPLQQR VLHLVALSPD IYRMPG
|
| |