Gene Rcas_0411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0411 
Symbol 
ID5537873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp518610 
End bp519761 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content60% 
IMG OID640892573 
Producttransposase IS4 family protein 
Protein accessionYP_001430560 
Protein GI156740431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.215728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCT GTGATGAATT GCACTCGAAA ATGGAAGAAC CCATTCGTCC GCTTGTCGAC 
GTCAAACACG CCCATCACCT CTCGAATTGG TTGTGGATCG TCTGCGGGAT TCTGCTCAGT
GGTTCGGTCG CATTAAGCAA GATTGCCCTT TACTTGCCCC TCACGGCGCA AGCGGAAGGG
CGGATTGCTC GCATTCGCCG TTGGTTGAAG AATGTGTACG TCGATGTCTG GCAATTCTAC
CGTCCCCTGC TGGAGAAGGT GTTGCAAGGA TGGCAAGCCG CAGAGGCGGC GGTGATCCTG
GACGGCGTCA TGGTCTTCGG CGACCGCTTG CAGATTTTCC GTCTCTCCTT GCGACATGGC
AGTCGTGCCA TCCCTTTGTC CTGGGTGGTC GTGCCGGGCA AAGGGCTGAC CAGCGTCGAA
CGGTTGCGTC CATTGATCCA GCGGGCCGCA GAGTTCCTGG CCCCACGCGT CGGCGCCGTT
GTGTTTCCGG CCGATCGCGG CTTTCGCGAT GTAGAATGGG CCGCTCTGTG TCTGGAAGTC
GGTTGGCATT ATGTCATCCG ACTGGCCAAC AACACCCTCA TCACTCTGGA GGATGGCCGC
CGCCTGTCCA TCGCGGCGCC GGGTGTGCCG CCTGGGGAAG CCTGCTATTG GCGGAACGCG
GCCATCACCC AGTCGCAGGA CTGGCCTGCC AACCTCTCTG TAACCTGGAC CAAAGGCGCA
CGCGGTCAGG CGCCGGAATT GCTGGCCGTG ATGAGCGATC GGCGCGCCTG CAACCAGCGG
TTGCGTGAAT ATGGTTGGCG TATGAGCATT GAAGAGAGTT TCCGCGATGA CAAATCGGGC
GGCTTTGACC TGGAACATAC ACGCCTCCAA GACCCGCAAC GGCTAGAACG CCTGTTGTTG
GCGGTCGCAA TCGCCACACT TTGGCGTCAC GAATTGGGCG AGCAGGCGCT CCACGATCAC
AGCGTCCAGG CCGAACTCGA CCCAGGCGGC AAGCGCCGCG AACTCAGCAT CTTCCAACTA
GGCCTGCGAT TCCTCAGGCG GTGTTTGTTA GCGCTTACCA CCGCGCGCTT ACCCAAATTG
CGGTTGGTTT TGTCCAACTT GGCGCTCGAG CCCCTCTCAC CAAGACACCT GGCCAAGGAG
AAATGTCAGT GA
 
Protein sequence
MSACDELHSK MEEPIRPLVD VKHAHHLSNW LWIVCGILLS GSVALSKIAL YLPLTAQAEG 
RIARIRRWLK NVYVDVWQFY RPLLEKVLQG WQAAEAAVIL DGVMVFGDRL QIFRLSLRHG
SRAIPLSWVV VPGKGLTSVE RLRPLIQRAA EFLAPRVGAV VFPADRGFRD VEWAALCLEV
GWHYVIRLAN NTLITLEDGR RLSIAAPGVP PGEACYWRNA AITQSQDWPA NLSVTWTKGA
RGQAPELLAV MSDRRACNQR LREYGWRMSI EESFRDDKSG GFDLEHTRLQ DPQRLERLLL
AVAIATLWRH ELGEQALHDH SVQAELDPGG KRRELSIFQL GLRFLRRCLL ALTTARLPKL
RLVLSNLALE PLSPRHLAKE KCQ