Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1229 |
Symbol | |
ID | 5538698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 1587892 |
End bp | 1588788 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640893364 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001431344 |
Protein GI | 156741215 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0581563 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATGC ATGGGTTTTA TCATACTTCT GGCATCGCTG CCGGCTTTGC CGATCTTGCC GAACGCTCTC GGCGGAGCGT CGTGCATCTG CTCAGCGCAG GCAGGGGCAG CGGAACAGGC GTGGTCTGGC AAATTGGCGG CGTTGTCCTC ACCAACGATC ACGTGGTTGC CGGAGCAGGC GGTATGCTGC GCGCGCAAAC ACTCGACGGG CGCGACCTGC CGGCGACGGT GATGGCACGC AGCCAGGACC TCGATCTGGC GTTGTTGCGC ATTCCAGTCG ATGATCTCTT GCCTATCACG GTCGGCGACT CGACGCGACT GCGGGTTGGT GAACTCGTCT TCGCAATTGG GCATCCATGG GGGCAACCGT GGGTCGTGAC CGCCGGCATC GTCAGCGGGT TGGGCGAAGC GGAGGCACGC AACGGTCAGC CGATAGCCTT CATCCGTTCC GACGTGCGGC TGGCGCCGGG CAACTCGGGG GGACCACTCC TCGATGCGCA CGGTCAGGTC ATTGGGATCA ACGCGATGGT TTTCGGCGGC GATCTCTCGG TAGCGATTGC CAGCCACGTC GTCGAGTCAT GGCTCAACGG CGCGCAGGGA CGGCGGGTGC GCCTCGGCGT CGGGGTGCAG CCATCGCCGC TACCGACGGG ATTGTTGAAC GGGCGTGCGC ATGGGCTGTT AGTCATCAGT ATCGAGCCTG GCAGTCCAGC AGAACAGGCA GGGTTGATGG TCGGCGACTT GCTGCTCCAC GCCGATGCGG TATCACTGGA GCGTCCGGAA GACCTGCACG CGGCGCTCCG GCGCACAGCG GAAGCAACAG TACGCCTCCG CCTGCTGCGC GCCGGCGCTA TCCGTGTCAT CGACGCGCCC CTGGACAGGG CAGTACAGGA ACCATGA
|
Protein sequence | MMMHGFYHTS GIAAGFADLA ERSRRSVVHL LSAGRGSGTG VVWQIGGVVL TNDHVVAGAG GMLRAQTLDG RDLPATVMAR SQDLDLALLR IPVDDLLPIT VGDSTRLRVG ELVFAIGHPW GQPWVVTAGI VSGLGEAEAR NGQPIAFIRS DVRLAPGNSG GPLLDAHGQV IGINAMVFGG DLSVAIASHV VESWLNGAQG RRVRLGVGVQ PSPLPTGLLN GRAHGLLVIS IEPGSPAEQA GLMVGDLLLH ADAVSLERPE DLHAALRRTA EATVRLRLLR AGAIRVIDAP LDRAVQEP
|
| |