Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4167 |
Symbol | |
ID | 5541678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5390600 |
End bp | 5392324 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640896278 |
Product | alpha amylase catalytic region |
Protein accession | YP_001434216 |
Protein GI | 156744087 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02456] trehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.718668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00459901 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATTCGC TCAAATGGTG GCAAACGACG GTGTTCTATC AGATCTATCC GCGCAGTTTC GCCGATGGGA ACGGAGATGG CATCGGCGAT TTCGCCGGGA TGATCGACCG GCTCGATTAC CTGCGCGACT TGGGAGTCGG TGCGCTCTGG CTCTCGCCAC ACTATCCTTC ACCCAATGCC GACTGCGGCT ACGACATCTC CGATTACACC GGCGTTGCCC CAGAGTACGG CACACTGGAC GATTTTCGAC GTTTTCTGGA AGGCGCGCAC GCACGCGGTA TGCGCGTCCT GCTCGACCTC GTCCTCAACC ACACGTCCGA AGATCACCCC TGGTTCCGGG AGTCACGCTC CAGCCGCAAC AATCCAAAGC GCGACTGGTA CATCTGGCGC GATCCGGCGC CCGACGGCGG ACCGCCGAAC AACTGGTATT CGGCGTTTGG CGGTTCTGCC TGGACGTTCG ACGAAGCGAC CGGGCAGTAT TACTATCACT TTTTCTTCAA GGAACAGCCC GACCTGAATT GGCGCAACCC GGATGTCAAA CAGGCGATGT GGCATGCAGT GCGGTTCTGG CTCGATATGG GGGTGGATGG GTTCCGCCTC GATGCGATTG ACACGATCTT CGAAGACCCG AACCTGACGC CGCAGCAGTC GAAATTGTCG CAGATCGAGA TGCTGCGCAT CTGGCGCGAG AATCGTCCGC CGGAGGAGAC AAAAGAACTC TGGGAACAGT TCGCCCTGAT GTTCCGCCAC CAGGTGCAGC AGCCAGGATT GCACGAGTTG ATGAAAGAAC TACGCGCATT AGTGGACGAA TATCCAGGTG ATCGGGTGCT GATCGGCGAG GGTGACGATA TTGCGTACTA CGGCAACGGC CATGATGAGT TACACTTGGT GTTCAATTTT CCGCTCATGC GCACCAATCG GTTGACGCCT GCCTGGATCC GCGCCAATCA GGCGGAACGA CTGGCAGCGT TGCCGCCTGG CGCATGGCCC TGCAACACGC TGGGAAACCA TGACGTTGGC CGCATGTGGA CGGCATATGG CGACGGCGTC CACGATGCAG CGCTTGCCCG CTTGCATGTG ACGATGCTGC TGACACTGAA AGGCACACCG GTGCTCTACA ACGGCGAGGA GATCGGCATG ACCGATCTGC TGCTCGAACG GTTCGACCAG TTGCGTGACA ATCAGGCGGT TAACTTGTAT GACGCAGCGG TCAACGATGG CATTCCTGCC GATGAAGCGA TGAGGATGGC GGCAAAGATC AGCCGGGACC GCTGTCGAAC GCCGATCCAG TGGGCAAATG CGCCAAACGC CGGTTTCAGC CCGGCCGGTG TGACGACCTG GCTGCCGGTC AATCCGAACT ATGCTCAGGG AGTAAATGTC GCCGAACAGG TCGGCGATCC TCACTCGCTC CTCACCTTCT ATCGTCGCCT GATCGCCGCG CGCCAGGCGA CTCCCGCACT GTTGGAAGGC GATTACACGC CACTGCATCC AAACGAGGAG CGTTATCTGG CGTTTCTGCG CACCACACCG GAACAGCGCT GCCTTGTGGC GCTGAACTTT ACGGCTGAAC CGGTGACGGC AAGTTTTGCA CCTGGCGAAG ATTCACTTTT ACGCACGATT TTCTCGACCC ATCCGCGACC GGCAGGCGAA GAAAACCCGG CACACCTGAC ACTGGCGCCG TTCGAGGCAT ACATTGGAGA GATTCTGCCA GAGCGTCAGG GATAA
|
Protein sequence | MHSLKWWQTT VFYQIYPRSF ADGNGDGIGD FAGMIDRLDY LRDLGVGALW LSPHYPSPNA DCGYDISDYT GVAPEYGTLD DFRRFLEGAH ARGMRVLLDL VLNHTSEDHP WFRESRSSRN NPKRDWYIWR DPAPDGGPPN NWYSAFGGSA WTFDEATGQY YYHFFFKEQP DLNWRNPDVK QAMWHAVRFW LDMGVDGFRL DAIDTIFEDP NLTPQQSKLS QIEMLRIWRE NRPPEETKEL WEQFALMFRH QVQQPGLHEL MKELRALVDE YPGDRVLIGE GDDIAYYGNG HDELHLVFNF PLMRTNRLTP AWIRANQAER LAALPPGAWP CNTLGNHDVG RMWTAYGDGV HDAALARLHV TMLLTLKGTP VLYNGEEIGM TDLLLERFDQ LRDNQAVNLY DAAVNDGIPA DEAMRMAAKI SRDRCRTPIQ WANAPNAGFS PAGVTTWLPV NPNYAQGVNV AEQVGDPHSL LTFYRRLIAA RQATPALLEG DYTPLHPNEE RYLAFLRTTP EQRCLVALNF TAEPVTASFA PGEDSLLRTI FSTHPRPAGE ENPAHLTLAP FEAYIGEILP ERQG
|
| |