Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3759 |
Symbol | |
ID | 5541261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4929082 |
End bp | 4930017 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640895869 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001433816 |
Protein GI | 156743687 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.151954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0225337 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTACC ACATTCGACA TCTGACCCGT TTCCGCTATA GCGCGCCGGT GAGCGAGAGC GTCGTGGAAG TGCGCATGCA ACCGCGCAGT GATGGGCATC AGCGCCTTCA CAGCTTTCAG ATGACCACCA TCCCGCGGAC GACCATCTTC AGTTACCGCG ACATTCTGGG GAATATGGTA CATCACTTCG ATGTTCCCGG ACGCCATAAA TTGCTTACGG TGACATCCGA GGCGCTGGTA GAGGTCAGCG AGCCGTCGCC GATCACTGCG CTCGATGGCG AATCCTGGCG CGCGCTCGAT GCGCTTGCCG CCAGCGGCGA GTATTGGGAC CTGCTCCAAC CCGGTCGTTT TACGCACCCC AGCGATCTAT TGCGCGCCTT CGCCGCAGAG ATAGCGTTAC AGCGCGGCAG CGATCCGCTG ACGACCCTTT GCCGGCTCAA CACGCGCATC TACGATGCCT TTGAATATGC TCCGGGCAGC ACCCATGTGC ATTCGCCGAT TGACGACGCT CTGCGCATGC GTCGTGGCGT CTGCCAGGAC TTCGCCCACA TTATGATCAC GCTGACGCGT GCGCTCGGCA TCCCCTGTCG CTATGTCAGC GGCTACCTCT TCCATCGCGC CGAAGACCAT GATCGCTCAG AAGCGGACGC GACCCACGCC TGGGTGGAGG CCCTCCTGCC GGGTATGGGG TGGGTCGGAT TTGATCCGAC GAACAACCTG ATTGCCGGCG CGCGCCACAT TCGTGTGGCC ATTGGGCGCG ACTATGCTGA CGTTCCGCCA TCGCGCGGCG TCTACAAGGG TCAGGCGACA AGTGAACTCG ATGTTGCCGT GCGCGTTGCA CTGGTTGCGA CGCCGTCCGC AGCGGCTGAT GAGCCGCCAC CCGAATGGCA GGCGATGGAG CGCGCGTTTA TGGAAGAGGC GCACATGCAG CAGTGA
|
Protein sequence | MLYHIRHLTR FRYSAPVSES VVEVRMQPRS DGHQRLHSFQ MTTIPRTTIF SYRDILGNMV HHFDVPGRHK LLTVTSEALV EVSEPSPITA LDGESWRALD ALAASGEYWD LLQPGRFTHP SDLLRAFAAE IALQRGSDPL TTLCRLNTRI YDAFEYAPGS THVHSPIDDA LRMRRGVCQD FAHIMITLTR ALGIPCRYVS GYLFHRAEDH DRSEADATHA WVEALLPGMG WVGFDPTNNL IAGARHIRVA IGRDYADVPP SRGVYKGQAT SELDVAVRVA LVATPSAAAD EPPPEWQAME RAFMEEAHMQ Q
|
| |