Gene Rcas_2077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2077 
Symbol 
ID5539557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2664273 
End bp2667968 
Gene Length3696 bp 
Protein Length1231 aa 
Translation table11 
GC content62% 
IMG OID640894212 
Productalpha amylase catalytic region 
Protein accessionYP_001432181 
Protein GI156742052 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATGGG AGTCGAATGT GGAGATCAGC CGCATAACTG GCGCGCCGGT GATGGAGTTT 
CACATCGCCC GCAGCGCGCG CGACCGCTAC CATTTCGATG AGTCGCTGTT CCAGTTCAGC
GGCAATGTCA TCTTTGCCAA TTTTCACGCT GCGCGCGTCT TCGCGCAGAA GATGAACCAG
AAACGCGACC TGGCGCGCTT TCCAGAGCAG GCGGTGCGCG CCGGGCAGAT CAGCGCGATG
GGGCTGATCG ACGAAATCCT GCACTATGTC GCCGCCACGT ACCGCAAACG CACCCGCGAA
CACATGCTCG CCGACGCGCT TGCCTGGCTC AATGAGCGCC TGGGAGCGGA AAAGGTCGAG
ACGACGCTGC GCGCGTTCTG CGATGCGTTT CCGCCGCTTG CCGTCTACCG CCGGGAGCAG
GCGCTCGATG CGTACCTGGC TGACGAAACC GGCGGCATGC CCCATCGCGA AGTTGCGCTC
GAAGAGATGC TCATGCTCTG GCTGGAGAAT GTAAATCCAG CCTTCAATCC GTTCCTCGAA
CTGTTCGACG ACGCGCATCT TGAGCACGAT ACCGTCTATC TCAAGATGAT CGAGGAACTG
GATCGTTTTT TCGCCACGCA ACCGCCGCCA GCCGCCGTTC TGCCAGGCAT CACCCAACGC
ACCCTCATCG AACTGCTGCG CGCGCCGGCG CTGGCGTCGC CCCACTCTCT CGAAGGACAG
CTCAACTATG TCCTTGAACA CTGGGCTGGC TTCCTCGACC GCTCCTTCCT CTACCGCCTG
CTCAGCAGTC TGGACGTCAT CAAGGAAGAG GAGCGGGCGC GCTTTGGCGT CGGCGGCGGC
GATCTGACCG GCGCCTATGT GCCGGTCGCC GATTATCGCT ATCTGGAGGC GGAACCCGAA
CGCTACACGC CAGACCGCGA GTGGATGCCG CGCCTGGTGC TGCTTGCCAA GAACACGTTC
GTCTGGCTCG ATCAACTCAG CAAATGGTAT GGTCGCCCGA TCAAAACCCT CGACCAGGTG
CCGGACGAGG AACTCGACAC GATGGCGCGC CGTGGATTCA CCGGTCTGTG GCTGATCGGT
CTGTGGGAAC GCAGTGAGGC GTCGAAACGC ATCAAGCAGA TCATGGGCAA CCCGGATGCG
GTTGCGTCAG CGTATTCGCT CTACGACTAT CAGATCGCTG CGGCGCTCGG CGGGCAACCC
GCGCTCGACA ATCTGCGCGC GCGCGCCTGG CAGCGCGGCA TCCGCCTCTC CGCCGATATG
GTGCCGAACC ACGTCGGCAT CGATGGGCGC TGGGTGATCG AGCATCCCGA CTGGTTCATT
CAACTCCCCT ATCCACCCTT CCCGACCTAT ACCTTCAACG GTCCTGACCT GTCGAGCGAT
GAGCGGGTGG GCATCTTTAT CGAAGACCAC TACTACGACC GCACCGATGC TGCTGTCGTC
TTCAAGCGCC TTGATCGCTG GACCGGCGAG GCGCGCTATA TCTATCACGG CAATGATGGC
ACCAGCATGC CGTGGAACGA CACTGCCCAA CTCAACTACC TCAAACCCGA GGTGCGTGAG
GCGGTCATTC AGACAATCTT GCACGTGGCG CGGCTCTTCC CGATCATCCG CTTCGATGCG
GCAATGACGC TGGCGAAACG CCACTACCAT CGCCTCTGGT TCCCCGAACC GGGAACGGGT
GGCGATATTC CATCACGCGC AGGCTTCGGC ATGACCCGCG CGCAGTTCGA TGCCGCAATG
CCGGAAGAGT TCTGGCGCGA GGTCGTCGAC CGCTGTGCCG TCGAAGCGCC CGATACGCTG
CTGCTGGCGG AAGCCTTCTG GCTGATGGAA GGGTACTTCG TGCGCACCCT GGGTATGCAC
CGCGTGTATA ACAGCGCCTT CATGGTCTGC CTGCGCGATG AAGAAAATGC CAAGTACCGC
CAGATTATGA AGAACGTCCT TGAGTTCGAC CCGGAAGTGC TGCGCCGATT TGTCAACTTC
ATGAACAACC CCGACGAGCG CACGGCAGTC GATCAGTTTG GCAAGGGCGA CAAGTACTTC
GGGGTCTGCA CTATGCTGGT CACAATGCCC GGGTTGCCGA TGTTCGGGCA CGGACAGATC
GAAGGGTTCG CCGAGAAGTA TGGCATGGAG TACTACCGCG CCTACTGGGA CGAGAAGCCC
GATGAGTGGC TGATTGCGCG CCACGAGCGT GAAATCTTCC CACTGCTCCA CCGGCGCTAC
TTGTTTGCCG GCACGGACAA CTTTCTGCTC TACGATTTTT ATATGCCCGA CGGTCATGTG
AACGAAGATG TGTTCGCCTA CTCAAATCGT CACGGCGATG AGCGATCACT GGTGATCTAC
CACAACCGCT ATGCCCACAC AAGCGGTTGG GTGCGCCTCT CGGCGGCATT CATGGCGCGC
ACCGGCAGAG GCGACGAACG CGCGCTCGTG CAACGCGCGC TTGCCGAGGG GCTGGCGCTT
CGTGCTGGCG ACCACAACTA CACCATCTTC CGCGACCATC GCAGCGGTCT GGAGTATATC
CGCCACAGCC GCGATCTTGC GGAGCGCGGG CTGTACGTGG AACTTGGTCC CTATGATTGT
CACGTGTTCA TCGATTTCCG CGAGGTTACT GAGCGTCCCG ATGGACGCTA CGGCCAGATT
ACCGCCTACC TGGGCGGGCG CGGCGTGCCG AGCATCGACA TCGCGCTGCG CGAGATGTTC
CTCCAACCGG TGCTGATCCC CTTCCGTTCC CTGGTCAGCG CCACAATGCT GCGCGAACTG
GCTGCCTGGG CGGATCGACG CGCCATGCAG CGCGGCGCTT TGCCGGATCG TGATGGCGAA
GAAGAAGTCA TCGACAGCAT GCTCGAAGGA CCTGACGCAA TGGACGAGCC GACGCCGGAG
GAACTGGAGG CGCTGGCACA GCGTCACGCC AATATCAAGA GCGCCATTCC GTCAACCGGC
GCGCAAACCG TCACCGAACC GGAAGCGGAC GAACTCCCGG CGATTGAACA GTTCGAGCAA
CAGTTGCGCA TCTTCCTGAC CGAGTTCGCC ACGTTCAGCG GCGGAACCGC CAATACCGAA
CGCATCATTG TCGATATTCG CCGCCGTCTT GAAACCGTCA TCGAACTGCT GGTGTGCGCC
GATGCCCACC ACGCGCAACT GATCGGCGCG CCGCTGGCGG ACGACCCGGC GCGCTGGGGG
ACCGCCATTG GCTGGGCAGT CTTGCACCGC CTGGGGCTGG TGTTCAGCGA CGATGAAGCG
GCAAGCCGGA GCCGCAGCGT GATCGACGAA TATTTGCTCG GACGCGCGCT TCAGGCGGCG
CTGGTGGAGT TTGGCTACAG CGATGATGTG GCTGCCGATG CGGTGCAGTT GGCCCGGGCG
CTGACGGCGC ATCAGAACTG GCACGAGGAA TACGCCAACG ATGGGCGCGC GCTGGCAGCG
GCATTGATGG CAGACGGCGA TGTGCAGGCA TTTACACGGG TCAATCGTTT CCAGGGGGTT
CTGTGGTTCA ATAAAGAACG CTTCGAGCGT CTCCTGCACT GGCTAAACTT GATCGCCGCA
GTTGCTCTTC AGGTCGCCGA ACCCCACGAC GCCGATACGA TCCGCGCTGC CTGTGCGCGC
CTGATTGCGT CGCTGACGGA AGCCGCCGAA CAGAGCGGCT ACCGCGTCGA ACAACTACTG
GCGGCGCCAC AAAGTAAGGA GCCTTTGACC TTATAG
 
Protein sequence
MRWESNVEIS RITGAPVMEF HIARSARDRY HFDESLFQFS GNVIFANFHA ARVFAQKMNQ 
KRDLARFPEQ AVRAGQISAM GLIDEILHYV AATYRKRTRE HMLADALAWL NERLGAEKVE
TTLRAFCDAF PPLAVYRREQ ALDAYLADET GGMPHREVAL EEMLMLWLEN VNPAFNPFLE
LFDDAHLEHD TVYLKMIEEL DRFFATQPPP AAVLPGITQR TLIELLRAPA LASPHSLEGQ
LNYVLEHWAG FLDRSFLYRL LSSLDVIKEE ERARFGVGGG DLTGAYVPVA DYRYLEAEPE
RYTPDREWMP RLVLLAKNTF VWLDQLSKWY GRPIKTLDQV PDEELDTMAR RGFTGLWLIG
LWERSEASKR IKQIMGNPDA VASAYSLYDY QIAAALGGQP ALDNLRARAW QRGIRLSADM
VPNHVGIDGR WVIEHPDWFI QLPYPPFPTY TFNGPDLSSD ERVGIFIEDH YYDRTDAAVV
FKRLDRWTGE ARYIYHGNDG TSMPWNDTAQ LNYLKPEVRE AVIQTILHVA RLFPIIRFDA
AMTLAKRHYH RLWFPEPGTG GDIPSRAGFG MTRAQFDAAM PEEFWREVVD RCAVEAPDTL
LLAEAFWLME GYFVRTLGMH RVYNSAFMVC LRDEENAKYR QIMKNVLEFD PEVLRRFVNF
MNNPDERTAV DQFGKGDKYF GVCTMLVTMP GLPMFGHGQI EGFAEKYGME YYRAYWDEKP
DEWLIARHER EIFPLLHRRY LFAGTDNFLL YDFYMPDGHV NEDVFAYSNR HGDERSLVIY
HNRYAHTSGW VRLSAAFMAR TGRGDERALV QRALAEGLAL RAGDHNYTIF RDHRSGLEYI
RHSRDLAERG LYVELGPYDC HVFIDFREVT ERPDGRYGQI TAYLGGRGVP SIDIALREMF
LQPVLIPFRS LVSATMLREL AAWADRRAMQ RGALPDRDGE EEVIDSMLEG PDAMDEPTPE
ELEALAQRHA NIKSAIPSTG AQTVTEPEAD ELPAIEQFEQ QLRIFLTEFA TFSGGTANTE
RIIVDIRRRL ETVIELLVCA DAHHAQLIGA PLADDPARWG TAIGWAVLHR LGLVFSDDEA
ASRSRSVIDE YLLGRALQAA LVEFGYSDDV AADAVQLARA LTAHQNWHEE YANDGRALAA
ALMADGDVQA FTRVNRFQGV LWFNKERFER LLHWLNLIAA VALQVAEPHD ADTIRAACAR
LIASLTEAAE QSGYRVEQLL AAPQSKEPLT L