Gene Rcas_3863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3863 
Symbol 
ID5541367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5048026 
End bp5049651 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content59% 
IMG OID640895972 
Productalpha amylase catalytic region 
Protein accessionYP_001433917 
Protein GI156743788 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00249861 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.684332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGC AACCGGCTGG GCATCTCTGG TGGCAGCGCG GTGTTATCTA TCAGATCTAT 
CCTCGTTCGT TCCAGGACAG CAATGGCGAT GGCGTCGGTG ATCTGCGCGG CATTCGCTCG
CGCCTCGATT ATCTGGTCGA TCTGGGTATC GATGCCATCT GGCTGTCACC GATCTTTCCA
TCACCCATGG CCGATTTTGG CTACGATGTC GCCGATTACT GCGATATTCA CCCACTCTTC
GGCACACTTG CCGACTTCGA TGCGCTGGTT GCCGATGCGC ATCGGCGTAA TCTGAAGGTC
ATCCTCGACT TTGTGCCCAA CCACACATCG GATCAGCACC CATGGTTCAT CGAGTCGCGC
AGTTCACGCG ACAATCCGAA GCGCGATTGG TACATCTGGC GTGATCCGGC GCCCGATGGG
GGTCCGCCGA ACAACTGGCT TTCGTATTTT GGGGGGTCCG CCTGGGAATA CGATGCCACC
ACCGGTCAGT ATTACCTGCA TCTCTTCCTC AAAGAGCAAC CCGATCTGAA CTGGCGCAAC
CCACAGGTAC AGGCGGCAAT GCTCGATGTC ATGCGCTTCT GGCTTGACCG TGGCGTAGAT
GGATTCCGCG TCGATGTGAT GTGGCTGATG ATCAAGGATG CACAGTTCCG CGACAATCCG
CCCAATCCTG CCTGGAAACC TGGCATGATG CCGCATATGC GCATCCTTGA AGCCTGGTCC
GCCGATCAAC CGGAGGTTCA CCAGATTGTG GCCATGATGC GACGGGTGCT CGACTCCTAC
GACGAGCGGA TGATGGTCGG CGAGATCTAT TTGCCGTATG ATCGCCTGAT GCACTACTAT
GGAACGCCAG AATCTCCCGA AGCACATCTT CCGTTCAATT TTGCGCTGGT TCTGTTGCCG
TGGGACGCAC ACACGATTGC GCAGACGATT GCCGCGTATG AAGCATTGCT CCCACCTCAC
GGATGGCCCA ACTGGGTGCT GGGGAACCAC GACCAACCCC GAATCGCCAG TCGAGTGGGT
GAAGCGCAGG CGCGTGTTGC TGCCATGCTG CTGCTGACGC TACGCGGCAC GCCGACGATG
TACTACGGTG ATGAGATCGG CATGCGCAAT GTGCCGATTC CGCCTGATCG TGTGCAAGAC
CCGTTCGAGA AAAATGTGCC CGGCGAAGGG CATGGACGCG ATCCGCAGCG CACGCCAATG
CAGTGGGATG CCAGCGAGTA CGCCGGGTTC AGCAAGGTTC AGCCATGGTT GCCCCTCGCC
GACGATTACC GGCAGCGCAA TGTGGCAGCC CAGCGCAATG CACCGCATTC GATGCTCTCA
CTCTACCGGC GTTTGCTGAC GCTGCGCCGT TCTGAACCGG CGTTGTCAAT CGGTTCCTAC
CAGGCGATTG CTGTAGAAGG CGATGACACT GCGCGCCAGT CGGTGCTGGC ATTTGTGCGT
GAGGTGAACG GCTGTCGTTT TCTGGTCGCC CTCAACTTTG CGTCGCATCC TGCGCGTCTG
AGCCTTACGA CAATCGGTGA GGGCACGATT GCGCTCTCAA CTCACCTTGA TCGCGGTGGT
AGCACCGTTC AAGGCGATCT CGAGTTGCGC GCCGACGAAG GGGTGATTAT TGCGCTGGCG
CCATGA
 
Protein sequence
MTKQPAGHLW WQRGVIYQIY PRSFQDSNGD GVGDLRGIRS RLDYLVDLGI DAIWLSPIFP 
SPMADFGYDV ADYCDIHPLF GTLADFDALV ADAHRRNLKV ILDFVPNHTS DQHPWFIESR
SSRDNPKRDW YIWRDPAPDG GPPNNWLSYF GGSAWEYDAT TGQYYLHLFL KEQPDLNWRN
PQVQAAMLDV MRFWLDRGVD GFRVDVMWLM IKDAQFRDNP PNPAWKPGMM PHMRILEAWS
ADQPEVHQIV AMMRRVLDSY DERMMVGEIY LPYDRLMHYY GTPESPEAHL PFNFALVLLP
WDAHTIAQTI AAYEALLPPH GWPNWVLGNH DQPRIASRVG EAQARVAAML LLTLRGTPTM
YYGDEIGMRN VPIPPDRVQD PFEKNVPGEG HGRDPQRTPM QWDASEYAGF SKVQPWLPLA
DDYRQRNVAA QRNAPHSMLS LYRRLLTLRR SEPALSIGSY QAIAVEGDDT ARQSVLAFVR
EVNGCRFLVA LNFASHPARL SLTTIGEGTI ALSTHLDRGG STVQGDLELR ADEGVIIALA
P