Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2968 |
Symbol | |
ID | 5540459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3850702 |
End bp | 3852279 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640895087 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_001433045 |
Protein GI | 156742916 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.34566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00000602964 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCTATCA AGACGGGAAC GATTGCGATA TTGAGTCGAA ACGGCCGTTT AGTTAACGAG TCGTCTATGA GACCTTTTCC ACGCCGCAGC GGGGTGATCC TGCATCCGAC ATCACTTCCT GGACCATACG GCATCGGCGA CCTTGGCGAT GGGGCGTATC GCTTTGTCGA TTTTCTCGCC GCTGCCGGGC AGACGTACTG GCAGGTCCTG CCGCTCAGTC CGACCGGATA CGCCGACTCG CCCTACCAGG GAATTTCGGC GTTCGCCGGT AATCCGCTGA TGATCAACCC GGACCGCCTG ATCGCCGATG GTCATCTGAC GTCTGCCGAT CTGGTCGATC GTCCGGCGTT CTCTGATGAT CGGGTCGATT TTGGCGCTGT GATTGGTTGG AAATTCGCGC TGCTCAATCG CGCCTTTGCG CGCTTTCAGG CGACGCCATC CACCAACCGC GCGCGGTTCG AGCAGTTTTG TGATGAGCAG GCAGCCTGGC TCGACGAGGC GGCGCTGTTT ATGGCGCTGA AACAGGCGCA TGGCATGCGC GCCTGGACGG AGTGGCCTCT GGCGCTTGCA GCGCGCGACC CGGACGCGCT GGCGCTGGCG CGTGCTGAAC TTGCCGATGT GGTCGAAGCG CACAAATATT TCCAGTGGCT GTTCTTCACA CAGTGGCAGG AATTGCGACA CTACGCCAAT CGTCGCGGCA TCCGCATTAT TGGCGATGTG CCGATCTTCG TCGCGCTCGA CAGCGCCGAT GTCTGGGCGA ATCCACATCT GTTCTGCCTC GATGCAAACC TGCGCCCAAC CGTCGTTTCT GGCGTACCAC CTGATTATTT CAGTGAGACA GGGCAACTCT GGGGGCATCC GCTCTATCGG TGGGATGTCA TGGCTGCCGA TGGGTATCGC TGGTGGATCG ACCGCTTTCG CGCTTCGTTT ACGCTTGTCG ATGTCGTGCG TATCGACCAC TTTCGCGGTT TCTACAACTA CTGGGAAATT CCGGCAGGTG AAACGACTGC GATCAACGGT CGCTGGGTCG ATGGTCCGCG CGCCGATCTG TTCATGGCGG TCACCGCAGC GCTCGGCGAG GTGGCGATCA TCGCCGAAGA CCTTGGCGAT TTTACGCCTG AGTCGCGCGC CGGTCTCGAT GCGCTCATGG CACAGTTTGG CTTTCCGGGA ATGCGCATTC TTCAATTTGC CTTCAACCGC CGCGAAGGGG ACCGCTTCTT TCCGCACAAC TACCCGCGCG CCTGTGTCGT GTATACTGGA ACCCACGACA ACGACACCCT TGCCGGATGG TTCACCAACA GTTCGACCGA CGCCGAACGA CGTGATGCGC TGCGCTATCT CTGCGGCAGC GCCGACGATA TCGTCTGGGC ATTCATCCGC ACCGCCTGGA TGTCGGTCGC CGACACAGCC ATGACAACGG TGCAGGACCT GCTCGGTCTG GGGAGCGAGG CGCGGATGAA CCTGCCCGGC ACGCTTGGGT CGCATAACTG GACCTGGCGT GTGCCATCGG GAGCGCTTGA TCGGCATCGT GCCAGACGCC TGCGCGATCT GACGGAGATT TATCAGCGAT TGCCCTGA
|
Protein sequence | MSIKTGTIAI LSRNGRLVNE SSMRPFPRRS GVILHPTSLP GPYGIGDLGD GAYRFVDFLA AAGQTYWQVL PLSPTGYADS PYQGISAFAG NPLMINPDRL IADGHLTSAD LVDRPAFSDD RVDFGAVIGW KFALLNRAFA RFQATPSTNR ARFEQFCDEQ AAWLDEAALF MALKQAHGMR AWTEWPLALA ARDPDALALA RAELADVVEA HKYFQWLFFT QWQELRHYAN RRGIRIIGDV PIFVALDSAD VWANPHLFCL DANLRPTVVS GVPPDYFSET GQLWGHPLYR WDVMAADGYR WWIDRFRASF TLVDVVRIDH FRGFYNYWEI PAGETTAING RWVDGPRADL FMAVTAALGE VAIIAEDLGD FTPESRAGLD ALMAQFGFPG MRILQFAFNR REGDRFFPHN YPRACVVYTG THDNDTLAGW FTNSSTDAER RDALRYLCGS ADDIVWAFIR TAWMSVADTA MTTVQDLLGL GSEARMNLPG TLGSHNWTWR VPSGALDRHR ARRLRDLTEI YQRLP
|
| |