Gene Rcas_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0236 
Symbol 
ID5537698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp292656 
End bp293939 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content61% 
IMG OID640892400 
Productglycoside hydrolase family protein 
Protein accessionYP_001430387 
Protein GI156740258 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.946452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACACG TTTTTTCGTC ACCGCTGGCT GCATCGCTCT ATGCTGTGGC GCTGATCGTC 
TGGGCATTGC TGGTCGTCGA TACGGTGAGT CTTGCCCGGC GCGCCACTGA GCCGCCGATC
TTTCCGACGC CAACGACCGT CGCAGTGGTG ACGCCCGTCG CGGCAACGCG GGTAGCGGCG
CCCACATCGA TGCCGCCGGT GCAGGCAACT GTCGTTCCCC TCATGACGCC GGAACTTGAC
GCAGAAGATG GCTGGTTTCA TCCGAAGACC GGACGTTATA TTGCTGCCTG GCTCCCCAAC
TCGTTCGGCT CCGAAAATCG TGAGTCGTTT GAGGCGAATG CCGATATTCT CGATGAGATC
AGCCCGTTCT GGTACTCGCC GTCGCCTGGC GGCGAGCTGC GGTTTGGGCG CGAGGCGCGC
GACCGCACGC TGCTCGAACT GGCGCATGGC AAAAACGTCC TCGTCATTCC AACCGTCCAC
AACGTCGTTA CCGGCGAAGA CCCGGTGCCG GGCATTCTGC GCAATCCGCG CCTGCGGTCG
TACCACGTGC AGCAGATCGT CGATGAGGTG TTGACCTATG GCTACGACGG GATCGATATC
GACTATGAGT TTCTCAGCAG CAGTCTGCGC GATGACTATA GCGCCTTTAT TCTGGAACTG
GCGGATGCGC TGCATGCGCA TGGCAAACTG CTGACGGTCG CTGTTCATGC GAAAGATTGT
GATTATTGCG GACTGGGAGG TTTTCAGGAT TGGGCTGTGA TCGGTCAGGT GGTTGACCGG
TTGCGCATTA TGACCTACGA TTACCACTGG CGCGGCGGCG GTCCTGGTCC GGTGGCGCCG
GTCTATTGGG TCGAACGGGT GGCGCGCTAC GCGGTGACGG TCGTTGATCC GGCGAAGGTG
GTGATCGGCG TGCCGTTCTA CGGCTACAAC TGGTCCCGTG ACGGCAGCGG CAATGCGCGC
GGGCAGACGT GGGCGATGAT TAATGAGATC ATTCAAACCT ACCGCCTGTC GGTCAATCTC
ATGGAGAGCA ATCAGAATGG TCTGGTGCAG GAAAACTGGA TCACCTACAG TTCGCGCACC
GAAGGACGGC GTGAGGTCTG GTTCGCTACG AGCAGCGGTC TCGACGCAAA ACTGCGCCTG
GTGCAGGAAC TCGATCTGGC GGGGATTGCG ATCTGGCGGC TCGGCGGCGA AGACCCGCGC
AACTGGGAGA TCATTCGCGC GCGCCTGCTC CAGGACCCGT ATGAGTCGCA GCGGGTGTTG
AGTCGCCTTT TGCCGGAGCA TTGA
 
Protein sequence
MRHVFSSPLA ASLYAVALIV WALLVVDTVS LARRATEPPI FPTPTTVAVV TPVAATRVAA 
PTSMPPVQAT VVPLMTPELD AEDGWFHPKT GRYIAAWLPN SFGSENRESF EANADILDEI
SPFWYSPSPG GELRFGREAR DRTLLELAHG KNVLVIPTVH NVVTGEDPVP GILRNPRLRS
YHVQQIVDEV LTYGYDGIDI DYEFLSSSLR DDYSAFILEL ADALHAHGKL LTVAVHAKDC
DYCGLGGFQD WAVIGQVVDR LRIMTYDYHW RGGGPGPVAP VYWVERVARY AVTVVDPAKV
VIGVPFYGYN WSRDGSGNAR GQTWAMINEI IQTYRLSVNL MESNQNGLVQ ENWITYSSRT
EGRREVWFAT SSGLDAKLRL VQELDLAGIA IWRLGGEDPR NWEIIRARLL QDPYESQRVL
SRLLPEH