Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0466 |
Symbol | |
ID | 5537929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 594428 |
End bp | 596032 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640892629 |
Product | glycosyl hydrolase BNR repeat-containing glycosyl hydrolase |
Protein accession | YP_001430615 |
Protein GI | 156740486 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.125416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.453675 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTC TGACGATACT GATTGCCGGT CTTCTGGTAG TCATCAGTCT GGCGCTGTCG CACTCCCCGG CAGCGGCGCA GCGCAGCCGC TGGACGACGC CCTTTGAGGT GTCGCCGCCG GTCACGGAAG CGACGTCAGC GCAGGCAGCG CCAACTCCGC CTCCTGCCGG ACAGAAGGCG GTGCGCTATG GCTCATCGTG GTTTCCAAGC CTTGCCATCG GTCCAACGGG GAGCGTTCAT ATTGTATGGT ATGGTGGCAT TGTCACCACT GCCGGCAATG AGGGGTCTAT CGATCTCCTC ATGTACCGCG AGTTGCGTGA CGGCGTCTGG TCGCCGTTCA ATGATGTCGT GCAGACTGCA ACAGGCGGAT ATACCGTTCG CAACAGTATT GTTATGAGTC GTGACGGGAA ATTGCACGTC TTGCTGCGCA TGGGCACGAC GATCCGGCAT GTCAGCGTTC CATGGGATCG GGCGTGGACC GCAGCGGCCT GGAGCGAACC GCGAACGGTT GGCAGGAGCG GTTATTACGT GGCGCTAGGA GTGGACAGTC AGGCGCGATT GCATGCCTTC TGGTCGCAAG GAGTCCCGGA CGAACCGGGT AAGCCGCGTC CCGAATGTCC GAACTGCGCT AATCTGTTCT ATCGCTATTC CGATACTGGT GGCGAAAGCT GGTCGGAACC TGTGAACCTG TCGAATTCGC CCTATGGCGA TAACCGACCA CAAGTGAAAG TGGATAGTCG TGACCGGGTG CATGTCATAT GGGATCAGGG GAAAGACTGG TATGCCGGTC TGGGCAAACC AAAGCAGGGG GTGTACCGTC GCTCGGATGA TCGCGGCAAC ACCTGGCGAC CGCCGGTTTA TTTCACCGTT CCTGACGATG CGATCCAACA GACCACTCTC GGCATTGCGG CGGGCGATAA CCCGATCGTC GTCTATCGCA CGCTCAAGGG AACGCTCTAC TACCAGTACT CGCGTGATGG CGGCGATAGC TGGAGTCTGC CAGACGTGCT GCCCGGCGTT GTTGCGCGAG ATACCCGCGG GAATGATCTC GACCAGTACT CCATGGCTAC CGATAGTGAA GGGAATGTCC ATCTCCTGAT GGTTGGCTAC CCGACGACAA TATCCAGCAT CGCTCCTGGC GCCGCGCCGT CGATGGTGTT GCACCTGGTG TGGAATGGCT CGCGCTGGAG TTCGCCCGAA ATCGTGTCGG CAAACGAGTT CTATCCCGAA TGGCCCGTTA TTGCCGTGTC GAACGGCAAT AAACTCCACG CCGTCTGGTA TACACGCCGG GGAGACGGCA TAGCAGCAGA CGAGCGCGAC CGCTATCGTA TCTGGTATAG TTCGCGCCAG CTGGATCTTC CGGAAACTGC ACCGCTGCCA CTGTTTACGC CGGTTCCGAC GGCGGTGCCA ACCGCTGTGC CTACTGCAAC TCCAATTCCT CCCACGTCCA CTCCCCTTCC AACCGATGTT GCGACCGCGC CACCGATCAG CGGTCCTCCC CGCTGGGAGA CGACGGCGCT GCCGGTGCTG GCGCTGGCAC TGCTGCCGCC TATCGGGGTC ATTGCCGGTG TTTATGCCTT GCGTCGATGG GGGAAACGCC GATAG
|
Protein sequence | MKRLTILIAG LLVVISLALS HSPAAAQRSR WTTPFEVSPP VTEATSAQAA PTPPPAGQKA VRYGSSWFPS LAIGPTGSVH IVWYGGIVTT AGNEGSIDLL MYRELRDGVW SPFNDVVQTA TGGYTVRNSI VMSRDGKLHV LLRMGTTIRH VSVPWDRAWT AAAWSEPRTV GRSGYYVALG VDSQARLHAF WSQGVPDEPG KPRPECPNCA NLFYRYSDTG GESWSEPVNL SNSPYGDNRP QVKVDSRDRV HVIWDQGKDW YAGLGKPKQG VYRRSDDRGN TWRPPVYFTV PDDAIQQTTL GIAAGDNPIV VYRTLKGTLY YQYSRDGGDS WSLPDVLPGV VARDTRGNDL DQYSMATDSE GNVHLLMVGY PTTISSIAPG AAPSMVLHLV WNGSRWSSPE IVSANEFYPE WPVIAVSNGN KLHAVWYTRR GDGIAADERD RYRIWYSSRQ LDLPETAPLP LFTPVPTAVP TAVPTATPIP PTSTPLPTDV ATAPPISGPP RWETTALPVL ALALLPPIGV IAGVYALRRW GKRR
|
| |