Gene Rcas_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3535 
Symbol 
ID5541034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4607786 
End bp4609576 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content61% 
IMG OID640895652 
Producthypothetical protein 
Protein accessionYP_001433602 
Protein GI156743473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCC GTAAATATTG CTCATTGTTG CTCCTGATCG TGCTGGCAGG CGCGCTGCTG 
TTGCCATTGC GCACTCAGGC GGCGGCGCGC AGCCCGGAAG AAGTGCCGCC GACGCAACCG
CCGTTTCGGG CGCGCCTCTT CCCGGAAACC GGGCACACGG CGGTGAATTC GTTCCTCTCC
TTCTGGGAAC GGACGCCGAA TGCTCTGTTT GTGCTCGGCT ATCCGATTTC GGCGCCATTC
ATCGAGGAAA GCTTTACGAA TCCCGGTGAG TATTACCGTG TGCAGTATTT TGAACGCGCC
ATTCTCGAAG AGCACCCGGA AAACTACGGG ACCCCCTACT ATATCCTTGG GCGGTTGCTG
GGCACGGAAA TCGTGAAGGG CCGCGAGAGC GAAGCGCCGT TCCAGCCGGT TCCCGACCCG
CGCGACGGCA CGTGGGACGA GTTCACCCGC CATACCCTGC GCAATTCGCC GGCGCCATTC
CGCAGTTTCT GGCTCAACAA CGGCGGCCTG GCGGTCTTTG GACGTCCCCT TTCCGAACAG
TTTCAGGAAG TGAACCAGGC GGACGGCAAT GTCTACTGGG TGCAGTACTT CGAGCGGCAG
CGCATGGAAT GGCATCCCGA CGAGCCTGAT CCGCGCTACC GCATTCTGCT TGGTCTCCTC
GGCAATGAAT ATCGTGATGC GCATCATCAA GGCGCCCCGG CATTCATCCC CGGCGCAACT
GCGCCTGATC AACCGCAACC GCCGCCTGCG AGCGATTTTG CCTACGGGTA TAACGTGATT
CTGTACGGTC AGGGGTCAAC CTCGTGGCAG GATCGACCGC GTGTGCTGCG TTTGGTGAAA
GAGAGCGGCT TTGGTTGGGT GCGCCAGCAG GTGCGCTGGA TGGATCTGCA CGACCGTTCT
GGCGCGATCT ACTGGGGTGA ACTCGATACT ATCGTCGAAG ATTGCCATCG GGAGGGGGTG
AAGGTCTTGT TGAGCGTCGT CGCGGCGCCT TCATGGGCGA CGCCCAACGG CAGGAATGGT
CTGCCGTCGC GCGAGCATTT CGGCACCTTC GCTTCTTTCA TGGGCGAAAT GGCAAAACGT
TACCGCGGTA AGGTTCAGGC GTACGAAATC TGGAACGAGC AGAACCTGGC GGTCGAGAAT
GGCGGGCGTG TGCCGAATGC GTCTTTCTAC ATGGACATGC TGGTGCAGGC GTCGCAGGCG
ATCAAGGCGA ATGACCCGGC GGCGCTGATC GTTTCTGGCG CGCCGTCGAG CACCGAGACG
AACGCACCGA CTATCGCCGT CAGCGACCTG GTCTTTTTGC AGCAGATGTT CGCCGACTCG
CGCTTCCGCG CGAATGTCGA TATTGTTGGC GTCCATCCCG GCGGCGCCGC CAACCCGCCC
GATACTTTCT GGCCCGACAA TCCGGGACCG GGACCGGGAT GGACGAATAG CCGCGAGTTC
TACTTCCGCC GCGTTGAGGA TGTGCGTGCT CTGATGGTGC GTTCCGGGTT GGGGGATATG
CCGATGTGGG TGACGGAATT TGGATGGGCG ACGCGCAACA ATACGCCAGG GTATGGCTTT
GGCAATCAGA TCTCGTTTGA AAAACAGGCG CAGTATATTG TGCGCGCATA CGAGATGGCG
CGCACGAACT ACTCTCCCTG GATGACCGGC ATGTTCCTGT GGCAGCTCAA CTTCGCCGTC
CCATGGCGCG CTCAGGGCAA TGAGTTGCAC GAACAGGCAA GCTACGGCGT GATCAATGGC
GACTGGAGTC CACGCCCGGC TTATCTGGCG CTCAAGGCGA TGCCAAAGTA G
 
Protein sequence
MPARKYCSLL LLIVLAGALL LPLRTQAAAR SPEEVPPTQP PFRARLFPET GHTAVNSFLS 
FWERTPNALF VLGYPISAPF IEESFTNPGE YYRVQYFERA ILEEHPENYG TPYYILGRLL
GTEIVKGRES EAPFQPVPDP RDGTWDEFTR HTLRNSPAPF RSFWLNNGGL AVFGRPLSEQ
FQEVNQADGN VYWVQYFERQ RMEWHPDEPD PRYRILLGLL GNEYRDAHHQ GAPAFIPGAT
APDQPQPPPA SDFAYGYNVI LYGQGSTSWQ DRPRVLRLVK ESGFGWVRQQ VRWMDLHDRS
GAIYWGELDT IVEDCHREGV KVLLSVVAAP SWATPNGRNG LPSREHFGTF ASFMGEMAKR
YRGKVQAYEI WNEQNLAVEN GGRVPNASFY MDMLVQASQA IKANDPAALI VSGAPSSTET
NAPTIAVSDL VFLQQMFADS RFRANVDIVG VHPGGAANPP DTFWPDNPGP GPGWTNSREF
YFRRVEDVRA LMVRSGLGDM PMWVTEFGWA TRNNTPGYGF GNQISFEKQA QYIVRAYEMA
RTNYSPWMTG MFLWQLNFAV PWRAQGNELH EQASYGVING DWSPRPAYLA LKAMPK