Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2022 |
Symbol | |
ID | 5539500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2593844 |
End bp | 2595034 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640894157 |
Product | amidohydrolase |
Protein accession | YP_001432128 |
Protein GI | 156741999 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACA AAGCCAACGC TATCGCATCT GAAATCGTTC GCCTGCGCCG CGACATTCAT GCGCACCCTG AACTTGCGTT CCAGGAAGTG CGGACTGCCC AACTGGTTGC GGAGACGCTG CGCGAGATCG GCGGCATCGA CATCCGCACT GGCGTCGGCA AAACTGGCGT CGTCGGGCAT TTGGGAACCG GAGATGGACC GACGATCGGC ATCCGTGCCG ATATGGACGC GCTGCCAATC GACGAAGCGA CCGGGTTGCC GTTTGCCTCG CAGAATCCTG GTGTGATGCA CGCCTGCGGG CATGATGCGC ATACCGCCAT CCTGCTCGGC GTTGCGCACC TGCTCAAGCA GGAGTTCGCC GCCGGCAATC TGCGCGGCAA TGTACGCTTT CTGTTCCAAC CTGCCGAGGA AGCGCAAGAT GCAGAGGGTC TCAGTGGCGC GCCCCGGATG ATCAACGATG GCGCGCTCGA TGGCGTCGAT CACGTTATCG CGTTACACGT CGACTCCGGG TTGCCGGTGG GAAAAATCAC CATCCGTGAG GGGGCGAGTT CAGCGGCGGT TGATAGTTTT CGTGGTTGGA TCACGGGGAG CGGCGGACAT GGCGCCTATC CGCATCTGGG AACGGACCCG CTCTGGATGC TGTTGCCGGT GATGCAGGCG CTGCACGGAA TCGTTGCGCG TCGTGTCAAC CCGATGCACC CGGCAGTCGT GAGCCTTGGC GTTGTGCGCG GCGGCACGGC GTCGAACGTT ATTCCCGCTG AGGTGTATCT GGAGGGCACA CTGCGCAGTT TCGATCCGCA GGTGCGTGAG CAGTTGCTTG TCGAGGTGGA GCGCGCCTTT GCCGTCGCGC GCGCCGTTGG CGGCGATTAT CGGCTGGAAA TCGAGCGCGG CTATCCCGCC GGACACAATG ATGCCACAGT GAGCGACTGG ATTTCGGCAA CTGTTACCGA TCTGATCGGC GCTGATGCGA TTGATCGCAG TCGCACCGGT ATGGGTGCGG AGGATTTCGC TTATATGACG CAGAAAGCGC CTGGTGCGAT GTTCATGCTT GGCGCGGCCA TCGATGATGG TGTGAGCCGT GGGCATCATA CGCCGATCTT CGACATCGAC GAGCGCGCGT TGCCGATCGG CGCGGCTATT CTTGCCGAAA CCGCGCGCCG CTATCTGGCC GGTAATACCA ATGACCTGTG A
|
Protein sequence | MLDKANAIAS EIVRLRRDIH AHPELAFQEV RTAQLVAETL REIGGIDIRT GVGKTGVVGH LGTGDGPTIG IRADMDALPI DEATGLPFAS QNPGVMHACG HDAHTAILLG VAHLLKQEFA AGNLRGNVRF LFQPAEEAQD AEGLSGAPRM INDGALDGVD HVIALHVDSG LPVGKITIRE GASSAAVDSF RGWITGSGGH GAYPHLGTDP LWMLLPVMQA LHGIVARRVN PMHPAVVSLG VVRGGTASNV IPAEVYLEGT LRSFDPQVRE QLLVEVERAF AVARAVGGDY RLEIERGYPA GHNDATVSDW ISATVTDLIG ADAIDRSRTG MGAEDFAYMT QKAPGAMFML GAAIDDGVSR GHHTPIFDID ERALPIGAAI LAETARRYLA GNTNDL
|
| |