Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0232 |
Symbol | |
ID | 5537694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 288729 |
End bp | 289787 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640892396 |
Product | peptidase M42 family protein |
Protein accession | YP_001430383 |
Protein GI | 156740254 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAACC ATTCGCTTGC CTTTTTGAAG CAACTGCTCG CCACTCCTGG TCCTTCCGGT GAAGAAGTCG CCGCCGGGCG CGTCTGGCGA CGCGAAGCCG AAACCTTTGC TGACCGGGTC TATGCCGATG TGCGCGGGAG TTCGTATGCT GTGCTCGAAG GGGGCGCGCC GCGCGTGCTG CTCGCCGGAC ATATCGACGA GATTGGGGTG ATGGTCAGTT ATATCGACGA CGATGGCTTC CTCTGGTTTT CGCCGATTGG CGGGTGGGAC CCGCAGGTGC TCGTCGGGCA GCGGGTGCGG TTGCTGGGGC GCGCCGGCGA TGTAATCGGC GTGATCGGGA AGAAACCCAT CCACCAGATG AAATCCGAGG AACGGGAAAA AGCCAGCAAG ATTGAAGACC TCTGGATCGA TATTGGTGCG GCGAACCGGG CAGAAGCCGA GGCGCTCGTG CGTGTCGGCG CTGCTGGAGT GATCGATGCG CCGATCTACG ATCTGCCGGG TGGAAAGGTT GTCTCACGCA GCATCGACGA CCGGATTGGC GCGTTCACCG TGCTGGAAGC GCTGCGCCTG CTGGCGCGCG ACCGCCCGCG CGCGACGGTG GCAGCAGTGG CGACATCGCA AGAGGAGATC ACCTTTGCGG GAGCGCGCAC CGCAGCGTTC AGTTTCGAAC CGCAGGTGGC GATTGCGGTG GATGTGACGT TTGCCACCGA TCACCCCAAT GCGGATCGGA AGCAGTATGG CAACGTGCGG TTGGGTGGCG GACCGGTGCT GTCGCGCGGT TCTGCCAACA GCCCGGTGGT GTACGATATG CTCGTGGCGG TCGCCGAGCG CGAGGGCATT CCGTACAGCG TGCAGATCAA CCCGCGCTAC ACCGGCACCG ACGCCGATGC CATTCACATC GCGCGTGGCG GTGTCGCTAC CGGCGTTGTG TCGATCCCGA ACCGCTACAT GCACTCACCC AACGAAATGA TCGCGCTGAG CGACGTTGAA CATGCCGCGC GCCTGATCGC TGCGTTTGTG CGCAGTCTGG GACCGGAGAC TGATTTCATT CCACGCTAA
|
Protein sequence | MNNHSLAFLK QLLATPGPSG EEVAAGRVWR REAETFADRV YADVRGSSYA VLEGGAPRVL LAGHIDEIGV MVSYIDDDGF LWFSPIGGWD PQVLVGQRVR LLGRAGDVIG VIGKKPIHQM KSEEREKASK IEDLWIDIGA ANRAEAEALV RVGAAGVIDA PIYDLPGGKV VSRSIDDRIG AFTVLEALRL LARDRPRATV AAVATSQEEI TFAGARTAAF SFEPQVAIAV DVTFATDHPN ADRKQYGNVR LGGGPVLSRG SANSPVVYDM LVAVAEREGI PYSVQINPRY TGTDADAIHI ARGGVATGVV SIPNRYMHSP NEMIALSDVE HAARLIAAFV RSLGPETDFI PR
|
| |