Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0236 |
Symbol | |
ID | 5537698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 292656 |
End bp | 293939 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640892400 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001430387 |
Protein GI | 156740258 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.946452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACACG TTTTTTCGTC ACCGCTGGCT GCATCGCTCT ATGCTGTGGC GCTGATCGTC TGGGCATTGC TGGTCGTCGA TACGGTGAGT CTTGCCCGGC GCGCCACTGA GCCGCCGATC TTTCCGACGC CAACGACCGT CGCAGTGGTG ACGCCCGTCG CGGCAACGCG GGTAGCGGCG CCCACATCGA TGCCGCCGGT GCAGGCAACT GTCGTTCCCC TCATGACGCC GGAACTTGAC GCAGAAGATG GCTGGTTTCA TCCGAAGACC GGACGTTATA TTGCTGCCTG GCTCCCCAAC TCGTTCGGCT CCGAAAATCG TGAGTCGTTT GAGGCGAATG CCGATATTCT CGATGAGATC AGCCCGTTCT GGTACTCGCC GTCGCCTGGC GGCGAGCTGC GGTTTGGGCG CGAGGCGCGC GACCGCACGC TGCTCGAACT GGCGCATGGC AAAAACGTCC TCGTCATTCC AACCGTCCAC AACGTCGTTA CCGGCGAAGA CCCGGTGCCG GGCATTCTGC GCAATCCGCG CCTGCGGTCG TACCACGTGC AGCAGATCGT CGATGAGGTG TTGACCTATG GCTACGACGG GATCGATATC GACTATGAGT TTCTCAGCAG CAGTCTGCGC GATGACTATA GCGCCTTTAT TCTGGAACTG GCGGATGCGC TGCATGCGCA TGGCAAACTG CTGACGGTCG CTGTTCATGC GAAAGATTGT GATTATTGCG GACTGGGAGG TTTTCAGGAT TGGGCTGTGA TCGGTCAGGT GGTTGACCGG TTGCGCATTA TGACCTACGA TTACCACTGG CGCGGCGGCG GTCCTGGTCC GGTGGCGCCG GTCTATTGGG TCGAACGGGT GGCGCGCTAC GCGGTGACGG TCGTTGATCC GGCGAAGGTG GTGATCGGCG TGCCGTTCTA CGGCTACAAC TGGTCCCGTG ACGGCAGCGG CAATGCGCGC GGGCAGACGT GGGCGATGAT TAATGAGATC ATTCAAACCT ACCGCCTGTC GGTCAATCTC ATGGAGAGCA ATCAGAATGG TCTGGTGCAG GAAAACTGGA TCACCTACAG TTCGCGCACC GAAGGACGGC GTGAGGTCTG GTTCGCTACG AGCAGCGGTC TCGACGCAAA ACTGCGCCTG GTGCAGGAAC TCGATCTGGC GGGGATTGCG ATCTGGCGGC TCGGCGGCGA AGACCCGCGC AACTGGGAGA TCATTCGCGC GCGCCTGCTC CAGGACCCGT ATGAGTCGCA GCGGGTGTTG AGTCGCCTTT TGCCGGAGCA TTGA
|
Protein sequence | MRHVFSSPLA ASLYAVALIV WALLVVDTVS LARRATEPPI FPTPTTVAVV TPVAATRVAA PTSMPPVQAT VVPLMTPELD AEDGWFHPKT GRYIAAWLPN SFGSENRESF EANADILDEI SPFWYSPSPG GELRFGREAR DRTLLELAHG KNVLVIPTVH NVVTGEDPVP GILRNPRLRS YHVQQIVDEV LTYGYDGIDI DYEFLSSSLR DDYSAFILEL ADALHAHGKL LTVAVHAKDC DYCGLGGFQD WAVIGQVVDR LRIMTYDYHW RGGGPGPVAP VYWVERVARY AVTVVDPAKV VIGVPFYGYN WSRDGSGNAR GQTWAMINEI IQTYRLSVNL MESNQNGLVQ ENWITYSSRT EGRREVWFAT SSGLDAKLRL VQELDLAGIA IWRLGGEDPR NWEIIRARLL QDPYESQRVL SRLLPEH
|
| |