Gene Rcas_0232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0232 
Symbol 
ID5537694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp288729 
End bp289787 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content63% 
IMG OID640892396 
Productpeptidase M42 family protein 
Protein accessionYP_001430383 
Protein GI156740254 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACC ATTCGCTTGC CTTTTTGAAG CAACTGCTCG CCACTCCTGG TCCTTCCGGT 
GAAGAAGTCG CCGCCGGGCG CGTCTGGCGA CGCGAAGCCG AAACCTTTGC TGACCGGGTC
TATGCCGATG TGCGCGGGAG TTCGTATGCT GTGCTCGAAG GGGGCGCGCC GCGCGTGCTG
CTCGCCGGAC ATATCGACGA GATTGGGGTG ATGGTCAGTT ATATCGACGA CGATGGCTTC
CTCTGGTTTT CGCCGATTGG CGGGTGGGAC CCGCAGGTGC TCGTCGGGCA GCGGGTGCGG
TTGCTGGGGC GCGCCGGCGA TGTAATCGGC GTGATCGGGA AGAAACCCAT CCACCAGATG
AAATCCGAGG AACGGGAAAA AGCCAGCAAG ATTGAAGACC TCTGGATCGA TATTGGTGCG
GCGAACCGGG CAGAAGCCGA GGCGCTCGTG CGTGTCGGCG CTGCTGGAGT GATCGATGCG
CCGATCTACG ATCTGCCGGG TGGAAAGGTT GTCTCACGCA GCATCGACGA CCGGATTGGC
GCGTTCACCG TGCTGGAAGC GCTGCGCCTG CTGGCGCGCG ACCGCCCGCG CGCGACGGTG
GCAGCAGTGG CGACATCGCA AGAGGAGATC ACCTTTGCGG GAGCGCGCAC CGCAGCGTTC
AGTTTCGAAC CGCAGGTGGC GATTGCGGTG GATGTGACGT TTGCCACCGA TCACCCCAAT
GCGGATCGGA AGCAGTATGG CAACGTGCGG TTGGGTGGCG GACCGGTGCT GTCGCGCGGT
TCTGCCAACA GCCCGGTGGT GTACGATATG CTCGTGGCGG TCGCCGAGCG CGAGGGCATT
CCGTACAGCG TGCAGATCAA CCCGCGCTAC ACCGGCACCG ACGCCGATGC CATTCACATC
GCGCGTGGCG GTGTCGCTAC CGGCGTTGTG TCGATCCCGA ACCGCTACAT GCACTCACCC
AACGAAATGA TCGCGCTGAG CGACGTTGAA CATGCCGCGC GCCTGATCGC TGCGTTTGTG
CGCAGTCTGG GACCGGAGAC TGATTTCATT CCACGCTAA
 
Protein sequence
MNNHSLAFLK QLLATPGPSG EEVAAGRVWR REAETFADRV YADVRGSSYA VLEGGAPRVL 
LAGHIDEIGV MVSYIDDDGF LWFSPIGGWD PQVLVGQRVR LLGRAGDVIG VIGKKPIHQM
KSEEREKASK IEDLWIDIGA ANRAEAEALV RVGAAGVIDA PIYDLPGGKV VSRSIDDRIG
AFTVLEALRL LARDRPRATV AAVATSQEEI TFAGARTAAF SFEPQVAIAV DVTFATDHPN
ADRKQYGNVR LGGGPVLSRG SANSPVVYDM LVAVAEREGI PYSVQINPRY TGTDADAIHI
ARGGVATGVV SIPNRYMHSP NEMIALSDVE HAARLIAAFV RSLGPETDFI PR