Gene Rcas_3547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3547 
Symbol 
ID5541048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4625989 
End bp4627365 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content48% 
IMG OID640895666 
Producthypothetical protein 
Protein accessionYP_001433614 
Protein GI156743485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0722197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATAG TAACGGATGA GTTGTTGGTG GAGGCGTTGC TTGCGATTGA AGAGACCAAT 
ACACCAATGA CGTTTAGCGA GATCGCCAAA AGCATCAGCA AACGACGTGG TGGCGTTCAA
CTGACAGATG ATGAAATAGG GGACTTAAGA GAATGTCTCT ACCAGAACGG GAACATGTAC
ATCCTCTGGT CTGCCAAAGA CCGAACCTGG ACAATCACAC CTGAAGGGCG CGCTTACGCA
CGTCAGATCA GCATTGAGCC TGAAGAGCTC GTGGACATTT CAGAAACTGA TGAAGCGCTT
GAGCCTAAAG GCGCGCCTTT CAATCCAGCA CTGATCAAGG TCGATGTCGA TCAGATGCAT
ATCTATCACG CCCTTAAGCT TATTCGTGAG CAAAGACTCG TTCTTCAGCC AGAATTTCAG
CGCAATTTTA TTTGGGACGA AGTTCGTCAA AGCCGTCTTA TTGAGTCTAT TCTTCTTCGT
ATCGCGCTGC CAGCATTTTA TCTTGATGCT CCGAGGGAAG ACACCTACGT CGTCATTGAT
GGTCTCCAAC GTCTGAAAAC TCTCGACCGC TTTTGCAATG AAAAATCGCT CAAACTAACT
GGTTTGGAAT ACCTGCGCGA GTTTGAGAAT CACGGGTTTA GCGATCTCCC TTCCCATATG
CGATCCCGCC TTGAAGAAAC ACGCCTCACG ATGCACATCA TTCGACCAGA AACCCCTTTG
CAGGTCAAAT TCATTATCTT TCGCCGTATC AATACTGGCG GTTTAGTCTT GACCAATCAG
GAGATCAGAC ATGCGCTCTA TCAAGGAAAT GATGGACGCG CTTCTCGTTT GCTCAAGAAC
CTCGCTGAAA GCCCGGAATT TCTCGATGCG ACTGATCGTT CGATCAGTCC GCGACGGATG
GATGATCGTG AATGTGTTCT GCGTTTTCTT ACGTTTGTGC GTTATCCATA CGAGCAGTTC
GGTCGAAACA TGAGCGTTGG CGAACCGCCA AACCTTGATG GATTGCTGAA TCGTACCATG
GCAGACCTGA ATGCACTGCC CTTTGAGGAA CATGATAGGC TCAAAGAGGT GTTCCGCGAT
AGCATGTGTA AGGCGCATCT CGTATTCGGT CGCCATGCTT TTCGCAAGAT ATACGGACGC
AATCAGAAAC GTCAACCGAT TAGCAAACCG CTTTTCGAGG TCTGGAGCAC ACTGCTCCGC
GATTGGCCAA TCGAAATTCT GGAACAGCGC CGTGAACAAT TAATCGATGG TTTTATCGAA
ATTATGCAAC ATGATTTTGA CTTCATCAAG TCTATCTCAT ATGGTACAGG AAGCGTAAGG
GCAGTTAAGT ATCGCTTTGA CCGAATCAAT AGAATGCTTC GAGAAACTCT GCGATGA
 
Protein sequence
MSIVTDELLV EALLAIEETN TPMTFSEIAK SISKRRGGVQ LTDDEIGDLR ECLYQNGNMY 
ILWSAKDRTW TITPEGRAYA RQISIEPEEL VDISETDEAL EPKGAPFNPA LIKVDVDQMH
IYHALKLIRE QRLVLQPEFQ RNFIWDEVRQ SRLIESILLR IALPAFYLDA PREDTYVVID
GLQRLKTLDR FCNEKSLKLT GLEYLREFEN HGFSDLPSHM RSRLEETRLT MHIIRPETPL
QVKFIIFRRI NTGGLVLTNQ EIRHALYQGN DGRASRLLKN LAESPEFLDA TDRSISPRRM
DDRECVLRFL TFVRYPYEQF GRNMSVGEPP NLDGLLNRTM ADLNALPFEE HDRLKEVFRD
SMCKAHLVFG RHAFRKIYGR NQKRQPISKP LFEVWSTLLR DWPIEILEQR REQLIDGFIE
IMQHDFDFIK SISYGTGSVR AVKYRFDRIN RMLRETLR