Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3863 |
Symbol | |
ID | 5541367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5048026 |
End bp | 5049651 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640895972 |
Product | alpha amylase catalytic region |
Protein accession | YP_001433917 |
Protein GI | 156743788 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00249861 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.684332 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGC AACCGGCTGG GCATCTCTGG TGGCAGCGCG GTGTTATCTA TCAGATCTAT CCTCGTTCGT TCCAGGACAG CAATGGCGAT GGCGTCGGTG ATCTGCGCGG CATTCGCTCG CGCCTCGATT ATCTGGTCGA TCTGGGTATC GATGCCATCT GGCTGTCACC GATCTTTCCA TCACCCATGG CCGATTTTGG CTACGATGTC GCCGATTACT GCGATATTCA CCCACTCTTC GGCACACTTG CCGACTTCGA TGCGCTGGTT GCCGATGCGC ATCGGCGTAA TCTGAAGGTC ATCCTCGACT TTGTGCCCAA CCACACATCG GATCAGCACC CATGGTTCAT CGAGTCGCGC AGTTCACGCG ACAATCCGAA GCGCGATTGG TACATCTGGC GTGATCCGGC GCCCGATGGG GGTCCGCCGA ACAACTGGCT TTCGTATTTT GGGGGGTCCG CCTGGGAATA CGATGCCACC ACCGGTCAGT ATTACCTGCA TCTCTTCCTC AAAGAGCAAC CCGATCTGAA CTGGCGCAAC CCACAGGTAC AGGCGGCAAT GCTCGATGTC ATGCGCTTCT GGCTTGACCG TGGCGTAGAT GGATTCCGCG TCGATGTGAT GTGGCTGATG ATCAAGGATG CACAGTTCCG CGACAATCCG CCCAATCCTG CCTGGAAACC TGGCATGATG CCGCATATGC GCATCCTTGA AGCCTGGTCC GCCGATCAAC CGGAGGTTCA CCAGATTGTG GCCATGATGC GACGGGTGCT CGACTCCTAC GACGAGCGGA TGATGGTCGG CGAGATCTAT TTGCCGTATG ATCGCCTGAT GCACTACTAT GGAACGCCAG AATCTCCCGA AGCACATCTT CCGTTCAATT TTGCGCTGGT TCTGTTGCCG TGGGACGCAC ACACGATTGC GCAGACGATT GCCGCGTATG AAGCATTGCT CCCACCTCAC GGATGGCCCA ACTGGGTGCT GGGGAACCAC GACCAACCCC GAATCGCCAG TCGAGTGGGT GAAGCGCAGG CGCGTGTTGC TGCCATGCTG CTGCTGACGC TACGCGGCAC GCCGACGATG TACTACGGTG ATGAGATCGG CATGCGCAAT GTGCCGATTC CGCCTGATCG TGTGCAAGAC CCGTTCGAGA AAAATGTGCC CGGCGAAGGG CATGGACGCG ATCCGCAGCG CACGCCAATG CAGTGGGATG CCAGCGAGTA CGCCGGGTTC AGCAAGGTTC AGCCATGGTT GCCCCTCGCC GACGATTACC GGCAGCGCAA TGTGGCAGCC CAGCGCAATG CACCGCATTC GATGCTCTCA CTCTACCGGC GTTTGCTGAC GCTGCGCCGT TCTGAACCGG CGTTGTCAAT CGGTTCCTAC CAGGCGATTG CTGTAGAAGG CGATGACACT GCGCGCCAGT CGGTGCTGGC ATTTGTGCGT GAGGTGAACG GCTGTCGTTT TCTGGTCGCC CTCAACTTTG CGTCGCATCC TGCGCGTCTG AGCCTTACGA CAATCGGTGA GGGCACGATT GCGCTCTCAA CTCACCTTGA TCGCGGTGGT AGCACCGTTC AAGGCGATCT CGAGTTGCGC GCCGACGAAG GGGTGATTAT TGCGCTGGCG CCATGA
|
Protein sequence | MTKQPAGHLW WQRGVIYQIY PRSFQDSNGD GVGDLRGIRS RLDYLVDLGI DAIWLSPIFP SPMADFGYDV ADYCDIHPLF GTLADFDALV ADAHRRNLKV ILDFVPNHTS DQHPWFIESR SSRDNPKRDW YIWRDPAPDG GPPNNWLSYF GGSAWEYDAT TGQYYLHLFL KEQPDLNWRN PQVQAAMLDV MRFWLDRGVD GFRVDVMWLM IKDAQFRDNP PNPAWKPGMM PHMRILEAWS ADQPEVHQIV AMMRRVLDSY DERMMVGEIY LPYDRLMHYY GTPESPEAHL PFNFALVLLP WDAHTIAQTI AAYEALLPPH GWPNWVLGNH DQPRIASRVG EAQARVAAML LLTLRGTPTM YYGDEIGMRN VPIPPDRVQD PFEKNVPGEG HGRDPQRTPM QWDASEYAGF SKVQPWLPLA DDYRQRNVAA QRNAPHSMLS LYRRLLTLRR SEPALSIGSY QAIAVEGDDT ARQSVLAFVR EVNGCRFLVA LNFASHPARL SLTTIGEGTI ALSTHLDRGG STVQGDLELR ADEGVIIALA P
|
| |