Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3505 |
Symbol | |
ID | 5541004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4572914 |
End bp | 4574011 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640895623 |
Product | Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase |
Protein accession | YP_001433573 |
Protein GI | 156743444 |
COG category | [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGGCAC ACTTCATATC GTTCACGCTC TCGTATGGTT ATGACTCATT GCAAGAAGAG CATCGCATGA GCCAGTTTGA TTCGTATCGC GCGCTGGCGC TGCAAGTTAC CTGCCATGCT GTTAACGCCC TGAACAATCG GGTGGCTGTC CGTGAGCAGA TGTTGGCGAC TATTGTGCGG CTGCGTGAAC AGATACGCGC CAGTATCGCC TTTATCGGCA ATGATGTGCG GCTCGTGGTT TTGCCCGAAT ATTTCCTGAC CGGCTTTCCG TTGGGTGAAA GCATCGCTGT ATGGTCGGAA AAAGCCGCGA TTGATCCGGA TGGTCCCGAA TATGCTGCAC TTGGTCAGAT TGCGTGTGAT CTGCAAATCT TTCTTGCCGG TAATTGCTAT GAACGTGATC CGCACTTTCC CGGTCTCTAT TTTCAGGTGA GCTTTGTCAT TGACCCTTCC GGTCAGTGTG TGCTGCGCTA CCGACGCTTA AACTCGATGT TTGCGCCGAC GCCGCACGAT GTGTGGGATC GATTTTGCCA AATGTACGGT CCTGATGCGC TCTTCCCGGT CGCCGACACT GCAATCGGGC GGTTGGCCTG CATCGCCTCA GAAGAGATTC TCTTCCCCGA AGTTGCGCGT TGTCTTGCGA TGCGTGGCGC CGAAGTATTT CTCCATTCAT CTTCGGAAGT GAGCAGCCCG GAATTAACCC CGAAGCACAT CGCCAAACGA GCGCGCGCGT TAGAAAATCT GGCCTATGTC ATTTCGGCGA ACTCTGCCGG TATTAGTGGC ATTCCAATCC CTGCTGCCTC GGTTGAGGGC GGTTCGCAGA TTGTTGATTA CACAGGGCGC GTGTTAGTCG AAGCCGGCCA GGGCGAGAGC ATGGCGGCCC ACGCTGAGAT CGATCTTGCC GCGCTCCGGC GCTACCGTCG TCGCCCTGGC ATGAACAACC TGCTGAGTCG TCAACGGTTT GATCTGTATG CCAGCAGTTA TGCGACTGCC GGCTTTTACC CGCCGAATAC GTTGTTGACC GGTGTTGCGG AACGGCAACA TTTTCTGCGT GTGCAACAAG AGACGATTGA GCGCCTGGCG CAGAAGGGGA TAATTTGA
|
Protein sequence | MAAHFISFTL SYGYDSLQEE HRMSQFDSYR ALALQVTCHA VNALNNRVAV REQMLATIVR LREQIRASIA FIGNDVRLVV LPEYFLTGFP LGESIAVWSE KAAIDPDGPE YAALGQIACD LQIFLAGNCY ERDPHFPGLY FQVSFVIDPS GQCVLRYRRL NSMFAPTPHD VWDRFCQMYG PDALFPVADT AIGRLACIAS EEILFPEVAR CLAMRGAEVF LHSSSEVSSP ELTPKHIAKR ARALENLAYV ISANSAGISG IPIPAASVEG GSQIVDYTGR VLVEAGQGES MAAHAEIDLA ALRRYRRRPG MNNLLSRQRF DLYASSYATA GFYPPNTLLT GVAERQHFLR VQQETIERLA QKGII
|
| |