Gene Rcas_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1148 
Symbol 
ID5538614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1485289 
End bp1486959 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content61% 
IMG OID640893280 
Producthypothetical protein 
Protein accessionYP_001431263 
Protein GI156741134 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.591181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTACG CCCATGAACA GACCGAAACG TGGAACACGA CCGCCGAGCA GATTGATGAT 
ATTCCGATTC TTATTGCGCA TATGCGCAGA ATGGGCTTGC CTCAGGCGCT TGATCGTCAC
ATTCCCACAC GCGCCTACTG GGGCAACCTC AGCGTTGGTT GGACTGCGGC GGTCTGGTTG
ACGCATCTGC TGTCGTGCAG CGATCACAAA CCCGCCCATG TGCAGCAGTG GGTGGAGACC
CATATCGCGG CGCTGCGCTG GTGCATCGGC AGCGAGATAA CGCATGCCGA CGTGGGCATT
GATCGGCTCC ACGATGTGCT GATCGGACTC AGCCAGGACG ACCAATGGCA GGCAATCGAG
ACCGATCTGA ACCGTCGTAT GCTCCGCGCC TGGCCATGGA CCTCTCGCCA GGTGAATCTG
CGCCTGTACG AAGGGCGTTC GTGGTTTGTT GCGCCGAGCG GCGCCTTTCA GATCGCCAGA
GTGCATCCCT GGCGCGCCAG AACGTTGCGC CAGTCAATTG TACTGGCAAC GATCCGTGCG
TCGAATCTGC CGTTCGTAAC GTGGTCGTTC CCCGAGGATC ATGTTCCACC TGTATTGTTC
GCCAGGATAC TGGAACGGAT CTCACAGGAC CTGCCCTCAC AACGGCTTCG ATTTATTGGT
GACTCGCTGT TCGCTCCAGG ACTCCGGGGT GCGGTTCATA TGCGCAACGA TGAGTACCTC
TGCCCGCTCC CCGATACGCA TCCCGATTCT CTCAATCCGC TCACAGCCCA TTTTGCTGCG
CATACGCCCG CATGCGCGGC GGGGCGAAAT GGCAATCCTC ACCATCTCAC TGCTGACGAC
AGTATCGAAT GGTACGCGCC GGTCAGTGTC GAGATCGATG GCGCGACCGT GGCGTGGAAC
GAGCGACGCA TTGCTGTGCG TTCACCGGTG CAGGCGCATC GGCTGGAAGA AGCGCTCCGC
ACCCGGTTAG TGCGCGCAGA GGCGGCATTG CTTGCGCTCG TCGAGCGTAA GCGTGGGAAA
CGTCGTCCGC GTTCCCTTGA AGCGTTGCGC GAAGCTGCCC ACGCCATCCT CGACAGTTAT
CAAGTTCATG GGTTGCTGCG CCTGGATTTT GCCGAGCAGG TCCAGGAGCG ACTTGTCCGC
CGGTACCGCG GACGCCCAAC GGGCATGCGC GTCGAGCGTG ATGTGGGCCT GAACGTTTCG
ACCGATGCCG ACGCGCTTGC GCAGGCGATC CGACGCCTGG GCTGGCAAAC CTTTGTGTCC
AACATTGCTC CGCACGACCT GTCCGCCGAC CGTATCCTTG CCATCGCCGC TCCGGTCTCT
GGATTCGAGC GTTTGAACGG TCGTCCGCTC TCGTTGGCGC CACATGAAGT GCATACTCCC
GAACTGGAGA CCGGACTGGT GCGTCTGCTC GCTCTGGGGC TGCGCACTCT CGCACTGCTG
GAAACGATTG CGCGCGACCA ATTGATCAAG GAAGAGGTGC TGTCCGCTTC GGACAGCGAA
CGTGCGGCAT CGCGCACCAC CGGCGAGCGA TTGCTCGACG CCTTCCAGGA TATTATGCTC
ACACCCGGCA TCAATCAGCG TCTAGGCGCG ATCACGCCGC TTTCACCGTT GCAACAGCGG
GTGCTGCATC TGGTGGCGTT GTCGCCGGAT ATCTACCGGA TGCCCGGGTA A
 
Protein sequence
MPYAHEQTET WNTTAEQIDD IPILIAHMRR MGLPQALDRH IPTRAYWGNL SVGWTAAVWL 
THLLSCSDHK PAHVQQWVET HIAALRWCIG SEITHADVGI DRLHDVLIGL SQDDQWQAIE
TDLNRRMLRA WPWTSRQVNL RLYEGRSWFV APSGAFQIAR VHPWRARTLR QSIVLATIRA
SNLPFVTWSF PEDHVPPVLF ARILERISQD LPSQRLRFIG DSLFAPGLRG AVHMRNDEYL
CPLPDTHPDS LNPLTAHFAA HTPACAAGRN GNPHHLTADD SIEWYAPVSV EIDGATVAWN
ERRIAVRSPV QAHRLEEALR TRLVRAEAAL LALVERKRGK RRPRSLEALR EAAHAILDSY
QVHGLLRLDF AEQVQERLVR RYRGRPTGMR VERDVGLNVS TDADALAQAI RRLGWQTFVS
NIAPHDLSAD RILAIAAPVS GFERLNGRPL SLAPHEVHTP ELETGLVRLL ALGLRTLALL
ETIARDQLIK EEVLSASDSE RAASRTTGER LLDAFQDIML TPGINQRLGA ITPLSPLQQR
VLHLVALSPD IYRMPG