Gene Rcas_1380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1380 
Symbol 
ID5538853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1765543 
End bp1766652 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content57% 
IMG OID640893518 
Producthypothetical protein 
Protein accessionYP_001431494 
Protein GI156741365 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02877] sporulation protein YhbH 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATTC ACCGCGCTGA GCGTGATTTG AACCGGTTCC GCCAGATTGT ACGCGGCAAG 
ATCAAAAAAG ACCTGCGCAA GTACATGTCG CAGGGCGAAA TGATTGGACG CCAGGGGCGC
AAATATGTGT CGATCCCGCT GCCGCAGATC GATCTGCCGC AGTTCCGCTA TGGCACACGC
CAGAGTGGCG GGGTTGGGCA GGGCGATGGC AACGTTGGCG ACCCAATTGG GCAGGGTGAT
GGCCAGTCGG GGCAGGGCGA GGCTGGCTCG GAACCAGGGC AGCACGTGAT CGAGGTCGAT
GTTACCATCG AAGAATTGGC GCAAATCCTT GGCGAAGAAC TGCAATTGCC CAACATCCAG
CCCAAGGGCA AAAAGAATAT TGTCTCGCAG AAGGATCGGT ATTCCGGCAT TCGCCGTGTT
GGTCCCGACT CGCTGCGGCA TTTCAAACGC ACCTATCGCG AAGCGCTCAA ACGCCAGATT
TCGTCGGGTG AGTACAACCT CGCCGATCCG ATCGTCGTGC CGATCCGCCA GGATATGCGC
TATCGCTCCT GGAAGGAAAC GCTTCAGCCA GAGTCGAACG CGGTCATTAT CTACATGATG
GACGTGAGCG GCTCGATGGG CGCTGAGCAA AAGGAGTTGG TGCGCATCAC GGCATTCTGG
ATTGAAACCT GGCTGCGCTC GCAGTACAAG GCGATCGATA TTCGCTATAT CGTTCACGAT
GCTGCTGCAA AAGAGGTCGA TCAGGAGACG TTCTACCACA TCCGCGAAGG CGGCGGCACC
AAGATCAGTT CGGCGTACAA ATTGTGCAAT AAACTGATCG ATGAGCGCTA CCCGGCTGAT
GAGTGGAATA TCTATCCGTT CCACTTCTCC GATGGCGACA ACTGGGGTGG CGGCGATACG
CGCGAGTGCA TCGAATTGCT GCGCACCCAA CTTCTTCCCA AGGTCAATCA GTTCTGCTAT
GGTCAGGTGC GTTCGCTCTA CGGCTCGGGG CGCTTTGCGC ACGACCTCGA AGAGCACCTG
GGCAAGCATG AGGCGCTGGT GATCTCGGAG ATTGCCGATC GCGACGATAT CTACGATGCG
ATCAAGGATT TTCTTGGCAA GGGTCGGTAG
 
Protein sequence
MSIHRAERDL NRFRQIVRGK IKKDLRKYMS QGEMIGRQGR KYVSIPLPQI DLPQFRYGTR 
QSGGVGQGDG NVGDPIGQGD GQSGQGEAGS EPGQHVIEVD VTIEELAQIL GEELQLPNIQ
PKGKKNIVSQ KDRYSGIRRV GPDSLRHFKR TYREALKRQI SSGEYNLADP IVVPIRQDMR
YRSWKETLQP ESNAVIIYMM DVSGSMGAEQ KELVRITAFW IETWLRSQYK AIDIRYIVHD
AAAKEVDQET FYHIREGGGT KISSAYKLCN KLIDERYPAD EWNIYPFHFS DGDNWGGGDT
RECIELLRTQ LLPKVNQFCY GQVRSLYGSG RFAHDLEEHL GKHEALVISE IADRDDIYDA
IKDFLGKGR