Gene Rcas_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1678 
Symbol 
ID5539154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2158350 
End bp2160434 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content63% 
IMG OID640893815 
Producthypothetical protein 
Protein accessionYP_001431788 
Protein GI156741659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000109442 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGCAAC ATCTGGTTGA TCGCTCTATC CGCATGGTTT TCGCTCATCT ACGCCGCAAC 
CGCCTGATAG CCGCCGCAAT CTTCACCGCG CTGCTGCTCC TCTACCTGCC GACTCTCACG
CGGGTGCATA CGTTCGATGC GTTGTCGTAT GTGACGAGCG TCGAACGCAA GCCATGGACG
GAGTTGTTCC ACCCGCACCA TCTGGCGTAT GGTCCGCTCG GTTCGTTGAT GCTGACGTTG
AGCCGCGCCC TGGGATACGA TAACGGCGCC GCGTTGCCGA TGCAACTGGT GAATGCTGTT
GCAGGAGCGT TGGGAGCGGC GCTCTTCTAT CTGACCGTAT GCGCGGCAAC CGGACGCGAT
GATGTGGCGC TGCCGGCGAC GTTGTTGTTG GGTGTGGCGT ATGCCTACTG GTACTACGCT
ATCGAGATCG AGGTCTATAC GCTTGCGACG TTCTTTCTGA TCGTCTGTCT GGCGTTGATG
ACCCGCCCAA CACCCTGGAC GGCGCGCCGC TGCCTGGCGT TGGGTATTGC ACAGGGTGGC
GCGGTGCTGT TCCATCAGAC GAATGTGTTG CTCTGTGCGC CGATTGTATT GAGCGCCCTG
TCCGACCTGC GCACTGCCTG GAAACCTGGC AGTGCGCGCG CGGCCGTCGG TCGCTGGAGC
GGGTATGCAG TCGCGCTGGC ATTGATCGTG GCGATGCCCT ACCTCTACGC CATGCTGGTC
GTCAGCAATT TTCGCGCGCC TGCGGAGATG CTGGCATGGC TCACGGAGTA TGCGCGCACC
GGATGGTGGG GCGGTCCGCT GACGGTTGCA ACGATTGCCG ATCTCGGCGC CGGTCTGAGC
GACACGCTGG CGCAACCAGG GGGCGGCTGG TTCTACCTGG CGCTCGGCGG GGTCGTGGCG
TGGGCGTGGG GCAGGCATTC CCAAATCCCG ACCGGCGAGC AGGATGATGC ACAGGGAAAC
CGGGTCGCGC TCACGCCGCT GATCGCCTGG CTCGCAACGT ATGGCGCATT TTTCGCCTGG
TGGGAACCGG ATAATATCGA GTTCTGGATC GCCAGTCTGC CGCCGGCATG CCTGGTGTTT
GCCGCAGCGC TCGCGCGGGC GCGCTGGTGG AGCGCACCGG TCTGGACATC CCTGGCAATT
GTTGGAGTCA TTGCGTGGGG CAATTATGGT TCTATCGTCC GGCGTGGCGA TCCGTCAACT
GACCTGCAAC GCCTGATTGC GCGTGAACTG GCAGCGCGCA GCACACCGGC CGATCTGTTG
ATTGTTCCTG ATGGATTGCA GGAACTCTAC CTGCCCTACT ATGAGCGCCG CGAGAACTTT
CTGTCGCTCA ACCAGGCGCT GTTCGACAGC GGCGGATCAT GGGATGACGC CTGTGCGACG
ATCCGCAGCC GGATTGCCAC GGCGCAACGC GCCGGCGCGG CGGTGCTGAT CGCCGATGAA
GCGCTCCATC CGCCGCCACG GGTGCTGAAC CGCCACGGCA TCGCGCAAGC GCAGGTGGAT
GGATGCTTTG CGCCATATCA CGCAACGCTG GTCGATCTTG CGTTGGCCGA ACCGCTTCCC
TCCTACCGGC GGCTGCCTTC CGCAACTGAA ACGGCGCTGG CGGGCGGATG GGTCTTCGAG
CGCGATGCGA TGGGCTGGCA GGCGTTTAAC GTCGCGGGTG AACGTCTTGA TGGGGGATGG
CGCTTCCGTC CCGGCATCGA CCCGGCGCTG CTCAGTCCGC TCTTGAACCT CGATGCGCGC
GAGGTGACTG CCATCGAGAT TCGGATGGCG AACGGCACGC GCGCCCGCGA CGCGCAACTC
TTCTATGCCG GAATCGATGG CGCGTTGACC GAGGAATATT CCGTGCGCTG GCAACTGGCA
GAGACAGCCG AAGCCGTAAC CTACGTCATC GATCTCCGGG AAGCGTCGGG ATGGCACGGC
ATAATCACCC GCCTGCGGTT CGATCCGGTT GGGATGGGTG ACGACGGCGA GATTCGCGTC
GAATGGGTGC GTCTGCGCCT GGCGACGCCA TCGGGGAAGC AGGCGCTCGC CAGTGTCGGT
CGCCTAAGCT CGACTTTATC CATCTGTGCC GCTCACGCAG CATAG
 
Protein sequence
MVQHLVDRSI RMVFAHLRRN RLIAAAIFTA LLLLYLPTLT RVHTFDALSY VTSVERKPWT 
ELFHPHHLAY GPLGSLMLTL SRALGYDNGA ALPMQLVNAV AGALGAALFY LTVCAATGRD
DVALPATLLL GVAYAYWYYA IEIEVYTLAT FFLIVCLALM TRPTPWTARR CLALGIAQGG
AVLFHQTNVL LCAPIVLSAL SDLRTAWKPG SARAAVGRWS GYAVALALIV AMPYLYAMLV
VSNFRAPAEM LAWLTEYART GWWGGPLTVA TIADLGAGLS DTLAQPGGGW FYLALGGVVA
WAWGRHSQIP TGEQDDAQGN RVALTPLIAW LATYGAFFAW WEPDNIEFWI ASLPPACLVF
AAALARARWW SAPVWTSLAI VGVIAWGNYG SIVRRGDPST DLQRLIAREL AARSTPADLL
IVPDGLQELY LPYYERRENF LSLNQALFDS GGSWDDACAT IRSRIATAQR AGAAVLIADE
ALHPPPRVLN RHGIAQAQVD GCFAPYHATL VDLALAEPLP SYRRLPSATE TALAGGWVFE
RDAMGWQAFN VAGERLDGGW RFRPGIDPAL LSPLLNLDAR EVTAIEIRMA NGTRARDAQL
FYAGIDGALT EEYSVRWQLA ETAEAVTYVI DLREASGWHG IITRLRFDPV GMGDDGEIRV
EWVRLRLATP SGKQALASVG RLSSTLSICA AHAA