Gene Rcas_0340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0340 
Symbol 
ID5537802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp423169 
End bp424647 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content60% 
IMG OID640892503 
Producthypothetical protein 
Protein accessionYP_001430490 
Protein GI156740361 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00130161 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTATC TTTATCTCCG TCTTGCCCTG ATCGTGCTGT TGCTCGCGGC AGGCGCGAGC 
AGCGCGGCAG CGGCGCCCGC CGCCGAGGAA GGAAGTACGA CCGCCGTTTT CTTTCGGGAA
ACGCAGAACT GGATTGCACC GCCGTTCTAT CAGTTCTGGA GCAAAAACGG CGGGTTGCCG
ATCTTTGGCT ACCCGCTGGC GCCGACCGAT TTGATGACTA GCCCGGACGA CGGGAATACC
TATGCCACCC AAATCTTTGA GCGCAACCGG CTTGAGTACC ATCCTGAAAA TCCTGAGCCA
TACCGGGTGC AACTTGGCCG CCTGGGCGTT GAGGTGCTGC AAAAGCAGGG GCGCAACTGG
CAGGATTTTC CCAAGGGTAC GCCAAAGCGC GGATGCGACT TCTTTCCTGA AACCGGGCAT
ACCCTCTGTG AGCCGTTCCG CAGTTACTGG CGACGCTATG GGTTAGACCT GGGCTTTCGC
GGCGTGACGC GCGCCGAGTC AATCGCCCTG TTCGGTCTGC CGATCTCTGA GCCGATGATG
GAGACTAACG CCGATGGTGC GACGGTGCTG ACGCAGTGGT TCGAACGCGC ACGCTTCGAA
TATCATCCCA ATAACCCGCC GCAGTATCGC GTCTTGCTCG GACTCCTCGG CAAGGAACTC
TATGGACCCT ATCGTCCCGG CGTGGCAAGC GTGAGCGGCA GGGTCGTGGA GCGCGGGATG
CCGTTCTGGG ATGCGGTCAG CAATAGCGGT ATGCAGATCA AGCAGGCGTC TGTCGGAATT
GAGGGACCGT ACTATCCCGA AGATCATCCG CTCAGCGGCG TGACGGAAGT GGTCATGACC
TCGCAAGGGG AGATTTCCGT TTCGACCGAA GGATGGTTTA CAACGCTCGT CGATATTCAA
CCCGATAACT CGTTCGCATT CAGCGACTTT ATTCCCACTT TGCCGGGCGC TTTCATGATG
CTGGGGCCGG GAAGGATCGA TGGCGTCTGC GCTGACGGAA AGGAAGCCGA TTTCTTGTCG
TATGGGTCAG TACTGTTTGG GTCAGGCCAG GATTTGGATT TTGCTCTGGA AGGCGGTCAG
GCAGCAAATT TTGGTGACTT TCGGGTGAAT GTCATCTGTC TCTCCCCGCC CGAGCCCGAC
GCGCGCTATC ACGACGCGGC GTTCGCCGCC GTGCAGTGCG CCGCCGGACG TCCCTTGACG
CGCGACCCGG AGTTTGACGC CTTCGCCGTC GAAGCCCAGC GCGCACATAG AAATGAGCGC
GTCGAGGCGA TGGTGCGCGA ACACCCTCGA ATACGCGGCA TCACTTTTCT TGGCGTCACG
TTCGACGGAG CGCCGTCGTC CGACCCGTGC ATCTTCGGCG GGAAGAACTT TCGCGACATT
CGCCTCCTGT TTGAACAGGC GACGACGATC GGCGTCGCGG TCTTCCCGTC GTCATCCGCC
AACTATCCCG TCGGCACGCT GGTCATCGTT CGAGACTGA
 
Protein sequence
MHYLYLRLAL IVLLLAAGAS SAAAAPAAEE GSTTAVFFRE TQNWIAPPFY QFWSKNGGLP 
IFGYPLAPTD LMTSPDDGNT YATQIFERNR LEYHPENPEP YRVQLGRLGV EVLQKQGRNW
QDFPKGTPKR GCDFFPETGH TLCEPFRSYW RRYGLDLGFR GVTRAESIAL FGLPISEPMM
ETNADGATVL TQWFERARFE YHPNNPPQYR VLLGLLGKEL YGPYRPGVAS VSGRVVERGM
PFWDAVSNSG MQIKQASVGI EGPYYPEDHP LSGVTEVVMT SQGEISVSTE GWFTTLVDIQ
PDNSFAFSDF IPTLPGAFMM LGPGRIDGVC ADGKEADFLS YGSVLFGSGQ DLDFALEGGQ
AANFGDFRVN VICLSPPEPD ARYHDAAFAA VQCAAGRPLT RDPEFDAFAV EAQRAHRNER
VEAMVREHPR IRGITFLGVT FDGAPSSDPC IFGGKNFRDI RLLFEQATTI GVAVFPSSSA
NYPVGTLVIV RD