Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3915 |
Symbol | |
ID | 5541421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5116383 |
End bp | 5118371 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640896025 |
Product | alpha amylase catalytic region |
Protein accession | YP_001433968 |
Protein GI | 156743839 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0962846 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCAA GGGTCCTGGC GGTGATTTTA ATAATGAAGG ACATGCCCAT GAACTTCATC CGTGACACCC TGAGTCGGCC ACGCCCGCCA AGTGTCAGGC GCTCGGTGCA ACTGCCGCGT CGCGTGACGT ACTACCCCTC TCCGGTCGAC TGGCGCGATG AGGTGATCTA TTTTCTCCTC GTTGATCGAT TCAGCGATGG TCAGGAAGAG ACCCGCCCGT TGCTCGACCG GCGCTATCTG GCGGCAGCGC GCCCGACACT GCCCAATGGT GAGCCGTGGC GCTGGGACCG CTGGGCAGTG TCGGGCGGTG AACGGTTTCA AGGCGGAACG CTGCGCGGCG TGGTCTCAAA ACTCGGCTAC CTGCACCGGC TCGGCGTCAC CACTCTGTGG CTCAGCCCGG TCTGCAAGCA ACGCGGTCAC CTGAACACCT ACCACGGCTA CGCAATCCAG GATTTTCTCG ATGTTGATCC GCGGTTTGGC ACGCGCCAGG ACCTCGTCGA TCTCGTCAGC GCCGCGCATG AACAGGGAAT GCGCGTGCTA CTCGACATTG TGTTCCAGCA CTCCGGTCCC AACTGGCGCT ACCCGCCGGA TGTTCCTGGC GGCGCGGAGA TGCCGCGCTA CACGACTGGA CGTTATCCGT TCGGCAGTTG GTTGGACGCG ACCGGCGCGC CGCTCCTGGG CGTTCCCGAT GTCGATGATG CTGCCTGGCC CGATGAGATG CGCAATGTCG GGTACTACAC GCGGGCTGGC GCCGGAGATT TGGGCGCTGG CGATCTCAAT GATCCGGCGG CAGAGCACAA GCGCTCCGAT TTCTTTACGC TGCGCGACAT CGATCTCGAC GCGCCCGGCG CGCTGACAGA CATGGCGCTC TGCTACAAAT ACTGGATTGC GCTCACCGAC TGTGATGGGT TTCGCCTCGA CACGCTCAAA CACGTCTCGT TCGAGCAGGC GCGCAACTTC TGTGGCACAA TCAAAGAGTT TGCCGCCAAC CTGGGCAAGA CCAACTTCTT CCTGGTTGGC GAGGTGGCGG GCGGCGATTT TGCTGCGACG CGCTACCTCG ATGCGCTGGA ACGCAACTTG AACGCGGCAC TCGACATTGG CGAAATGCGG TTGGCGCTGA GCGACGTGGC GAAAGGTCTG GCGCCAGCAC GCGCCTATTT CGACGGATTT GTTCCCGGGC TTGCCATTAT GGGGTCACAC CGCAACCTCG GCAGTCGGCA CATCTCGATC CTCGACGATC ACGATCACGT TTTTGGTGCG AAACTGCGCT TTTCGACCGA TGTTATGGCG CAGCATCAGG CTGCGGTTGC GGCGGCGTTG CAATTGTTCA CGCTCGGCAT TCCGTGCATC TACTACGGCA CAGAACAGGC GCTCGGTGGT CCAGAGCTGT CGGAGCGCCG CTGGCTGCCG GAGTGGGGGC GTGCCGATCG TTACCTGCGT GAGGCGATGT TTGGTCCGCT GCATCCGCGT GCATCGGAAC GCGCTGGACT CGATCCACAG GCGCGTGATG GGTCGTTGCC GGGGTTTGGT CCGTTCGGCA CTGCCGGGAG TCACTGCTTC GATGAGCGGT TCCCGGTCTA TGTGCGGATT GCAGCGCTGA GTGCGTTGCG TGCCGCCTAT CCGGTATTGC GCCATGGGCG CCAGTACCTG CGCCCGATCT CGAATTTCAA CCAGCCATTC GCTTTTCCGC CAGCCGGAGA AATCATCGCC TGGTCGCGCA TTCTCGACGA CGAAGAAGCG CTGTGCGTCA TCAACCCACA CGGCGTTGCC GCGCGCGGCG GTGATGTCGT GGTCGACGCG GCGCTGAACC GTCCTGGCGA TATGCTGACC ATCATCTTAA ACACAGCGCA ATCTGCTGAT CCGCTGGGGT ACGTCGGACC ACACCCGGTC GGTCAGCGTC TGCCGGTTCG AGAGCGCAAC GGCGCGTCGT ATGTTGAGAT TCGCAACTTG CCGCCAGCCG AAACGCTGGT GTTGACCAAT CGACCATAA
|
Protein sequence | MISRVLAVIL IMKDMPMNFI RDTLSRPRPP SVRRSVQLPR RVTYYPSPVD WRDEVIYFLL VDRFSDGQEE TRPLLDRRYL AAARPTLPNG EPWRWDRWAV SGGERFQGGT LRGVVSKLGY LHRLGVTTLW LSPVCKQRGH LNTYHGYAIQ DFLDVDPRFG TRQDLVDLVS AAHEQGMRVL LDIVFQHSGP NWRYPPDVPG GAEMPRYTTG RYPFGSWLDA TGAPLLGVPD VDDAAWPDEM RNVGYYTRAG AGDLGAGDLN DPAAEHKRSD FFTLRDIDLD APGALTDMAL CYKYWIALTD CDGFRLDTLK HVSFEQARNF CGTIKEFAAN LGKTNFFLVG EVAGGDFAAT RYLDALERNL NAALDIGEMR LALSDVAKGL APARAYFDGF VPGLAIMGSH RNLGSRHISI LDDHDHVFGA KLRFSTDVMA QHQAAVAAAL QLFTLGIPCI YYGTEQALGG PELSERRWLP EWGRADRYLR EAMFGPLHPR ASERAGLDPQ ARDGSLPGFG PFGTAGSHCF DERFPVYVRI AALSALRAAY PVLRHGRQYL RPISNFNQPF AFPPAGEIIA WSRILDDEEA LCVINPHGVA ARGGDVVVDA ALNRPGDMLT IILNTAQSAD PLGYVGPHPV GQRLPVRERN GASYVEIRNL PPAETLVLTN RP
|
| |