Gene Rcas_4246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4246 
Symbol 
ID5541757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5488068 
End bp5489849 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content63% 
IMG OID640896353 
Productalpha amylase catalytic region 
Protein accessionYP_001434291 
Protein GI156744162 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.286976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAC GCGCACTCTT TCTGATCATC GTTCTTATCA TCGTTGGGTG CGGTGCGCCT 
TCTGCGGCAG TTCCGCCAAC TGCCACACCG CCTGCCGCAC CTTCCGCTAC GCCGTTAAGC
GAAGAAGCAG CGCTGCTCGC CACCATGGCG GCGAATGCGC CCTCTCCCGC AGAGGCGACC
CCCACCGTCC CACTGCCGAC CCTGTTTCCG ACTATCACCC TCGGACCGAC TGCGGAACCG
CGACCGTTGC CGGCCGGATG GTGGGACACC GCCGTGTGCT ATGAAATCTT CGTGCGCTCG
TTCTACGACA GCGACGGCGA TGGGATCGGC GATATCAACG GGCTGATCGC AAAACTCGAC
TACATCAACG ATGGCGATCC AACCGGCGGT TCCGACCTTG GCGCCAACTG TATCTGGCTG
ATGCCGGTGG CGGAAGCCGC CAGCTACCAC GGTTACGATG TCATCGACTA TAACGCCATC
GAAAAAGATT ATGGCACAAA CGACGATTTC AAGCGTCTTG TGGCAGAGGC GAACCGGCGC
GGCATCCGGG TGATCGTCGA TCTAGTGTTG AACCACACAT CCAGCGCGCA TCCCTGGTTT
ATCTCGGCGC TGAACGATCC TGCATCTCCC TATCGCGACT GGTACATCTG GTCGCCGGTC
GATCCCGGCT ACCGCGGACC GTGGGGGCAA CAGGTCTGGC ACCGTTCACC GGCGCGCAAC
GAGTACTACT ATGGCGTTTT CGTCGCCGAA ATGCCCGATC TGAACTACCG CAACCCGGAA
GTCGTCGCTG AAGCGGAGAA GATCGCCGCC TTCTGGCTGA ACGAGATGGG AGTCGATGGG
TTCCGCCTCG ATGCGATCAA GCACATTGTG GAAAACGGTT CCGAGCAGGA AGGCACGCGC
GAAACCCATG CCTGGATGCG CTCATTCGAG GCAGCCATCG AGCGCATCAA ACCGGGAGCG
TTCACCGTTG GCGAAGTCTT CGGCGGGCGC GCCGGGTCGC TTGACGCCTA CTATCCCGAC
CAACTCGACA CCTACTTCGA GTTTGGCGTT GCGGAAGGAA TCTTGCGCTC CGCCAACACC
GGCGCGCCGG GACCGTACCT GACGGCGGTG GAAGATGCGC TCATGCGCCT TCCATACCAG
CGTTGGGCGC CATTCCTCAC CAACCACGAC CAGGAACGGG TGATGACTGT CCTCGGCGGC
GATGCCGGTA AGGCGCGCGT TGCGGCCATC GCCCTGCTGA CTCTTCCGGG TCTCCCATTT
GTGTACTATG GCGAGGAGAT CGGTATGACG GGCGCCAAAC CGGATGAGCG CATCCGCACG
CCGATGCAGT GGACGGGCGA ACCGGGAGCG GGATTCACGA CCGGTTCGCC CTGGCAGGCG
CCCCAGAGCG ATTTCCCGAC GGTCAACGTC GCCGCGCAGG ATACCGATCC CGACTCGCTG
CTCAACGTCT ATCGGACCCT CATCCGGTTG CACACCACCC GTCCGGCGCT CGGCAAGGGC
GATTTCACCG CGCTGAACGC GACCGGCGGC GCAGCGGCGT TCCTGCGGCG CAGCGGCGAC
GACGCGGCGC TGGTGGTGAT CAACTTCAGC GCCAACCCGC TCTCCGGCGT GACCCTCTCG
ACTGCGCAGA GCAATCTCGA TCCCGGCGCC TACACCCCTG AACTGCTCTT CGGCAGTGGA
AACCTGTCGC CGCTCATCGT CGGCGCCAAC GGGTCAATCA GCGCGTATGC GCTGCCTGAG
ATTCCGCCGC AGCGCGCGGT GATTGTTGGG TTGGGGAGAT GA
 
Protein sequence
MKARALFLII VLIIVGCGAP SAAVPPTATP PAAPSATPLS EEAALLATMA ANAPSPAEAT 
PTVPLPTLFP TITLGPTAEP RPLPAGWWDT AVCYEIFVRS FYDSDGDGIG DINGLIAKLD
YINDGDPTGG SDLGANCIWL MPVAEAASYH GYDVIDYNAI EKDYGTNDDF KRLVAEANRR
GIRVIVDLVL NHTSSAHPWF ISALNDPASP YRDWYIWSPV DPGYRGPWGQ QVWHRSPARN
EYYYGVFVAE MPDLNYRNPE VVAEAEKIAA FWLNEMGVDG FRLDAIKHIV ENGSEQEGTR
ETHAWMRSFE AAIERIKPGA FTVGEVFGGR AGSLDAYYPD QLDTYFEFGV AEGILRSANT
GAPGPYLTAV EDALMRLPYQ RWAPFLTNHD QERVMTVLGG DAGKARVAAI ALLTLPGLPF
VYYGEEIGMT GAKPDERIRT PMQWTGEPGA GFTTGSPWQA PQSDFPTVNV AAQDTDPDSL
LNVYRTLIRL HTTRPALGKG DFTALNATGG AAAFLRRSGD DAALVVINFS ANPLSGVTLS
TAQSNLDPGA YTPELLFGSG NLSPLIVGAN GSISAYALPE IPPQRAVIVG LGR