Gene Rcas_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1998 
Symbol 
ID5539476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2565164 
End bp2566609 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content60% 
IMG OID640894133 
Productalpha amylase catalytic region 
Protein accessionYP_001432104 
Protein GI156741975 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.722533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.515841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCC ACACGCCCGA CTGGGTCAAG CACGCAGTTT TCTATCAGAT TTTTCCTGAC 
CGTTTCGCCA AAAGCGAGCG GGTTGCCAAA CCTAACCACC TCCAGCCGTG GGATAGCCTG
CCAACGCCAG AAGGGTACAA AGGAGGCGAT CTTCTGGGAG TGATCGAACG ATTGGACTAC
TTGCAAGACC TGGGGATCAC CGCGATCTAC TTTACGCCGA TTTTCCAATC GGCATCGAAC
CATCGCTACC ACACCCACGA CTACTACCAG GTCGATCCGA TGCTCGGCGG CAACGAAGCG
TTTCGTGCGC TGCTGGAAGC CTGCCACCGG CGTGGGATGC GTGTCGTGCT CGATGGTGTG
TTCAACCACG CCAGTCGCGG CTTTTTCCAG TTCCACGACA TTCTTGAGAA TGGACCGTTT
TCCGCTTACC TGGATTGGTT CTTCATTGAG GGATGGCCCC TCAGCCCCTA CGATGGATCA
CGCCCGGCAA ACTATCGCGG TTGGTTCAAC AACCGGGCGT TGCCGAAGTT TAATACCGAC
AACCCACAGG TGCGCGAATT TCTGATGCGC GTCGCCGAAC ACTGGATCCG CCAGGGCATC
GACGGTTGGC GCCTCGATGT GCCGTTCGAG ATTACGACCG AGGGATTCTG GCAGGAGTTT
CGCCAGCGTG TGAAGGCGAT CAACCCCGAA GCGTACATTG TCGGTGAGGT ATGGCGCGAT
GCGCGCCGGT GGTTGCAGGG CGACCAGTTC GATGGCGTGA TGAACTACCT GTTCACCGGA
CCGACGATTG CGTATGTCGC CGGTCCGCGC GTCGATCCGG CGCAGGTCGT GGGGCGGGAT
TATGTCACCA TGCCGCCGTT GACGGCAGCG GAGTATGCGC GTGTGATCGG CGATGTGCTG
AGCCGGTACG ATTGGGAGGT CCAGTTGACC CAGTTGAACC TGTTCGATAG CCACGACACG
GCGCGCCTGC TGACGATTGC GCGCGGCGAC CGCAGCAGCG TGCGCCTGGC GACGATCCTG
CTCATGACGT TCCCTGGCGC GCCGTCGGTA TTCTACGGCG ACGAAATCGG ACTGCCCGGC
GGCGTCGATC CCGACGCGCG CCGCGCGATG CCATGGGATC GACCGGAGAC GTGGGATATG
GAGACGCTGG CGTACCACAA ACAGTTGATC GCTCTGCGCC ATGCGCTCCC GGCGCTGCGC
ACCGGTGCAT TCCACGTCCT GTACGCCGAC GATGAGGTCT TCGCCTTTGC ACGCAAACTG
AACGACCAGA TCGTGATCGT GGCGGTCAAC AACGACGAAC AGCCACGGCA GGTCGCCATT
CCGGTCGCCG ACCTTTTGTC CGACGGTCGT GAGATGATCG CCCGCTATGG GAACTATCAC
AACCGGATTG TTCATGGTAC ACTTCATGTA TCTGTGCCGG CTCGCGACGG GCTGATCCTG
ACATGA
 
Protein sequence
MTIHTPDWVK HAVFYQIFPD RFAKSERVAK PNHLQPWDSL PTPEGYKGGD LLGVIERLDY 
LQDLGITAIY FTPIFQSASN HRYHTHDYYQ VDPMLGGNEA FRALLEACHR RGMRVVLDGV
FNHASRGFFQ FHDILENGPF SAYLDWFFIE GWPLSPYDGS RPANYRGWFN NRALPKFNTD
NPQVREFLMR VAEHWIRQGI DGWRLDVPFE ITTEGFWQEF RQRVKAINPE AYIVGEVWRD
ARRWLQGDQF DGVMNYLFTG PTIAYVAGPR VDPAQVVGRD YVTMPPLTAA EYARVIGDVL
SRYDWEVQLT QLNLFDSHDT ARLLTIARGD RSSVRLATIL LMTFPGAPSV FYGDEIGLPG
GVDPDARRAM PWDRPETWDM ETLAYHKQLI ALRHALPALR TGAFHVLYAD DEVFAFARKL
NDQIVIVAVN NDEQPRQVAI PVADLLSDGR EMIARYGNYH NRIVHGTLHV SVPARDGLIL
T