Gene RoseRS_3310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3310 
Symbol 
ID5210285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4158451 
End bp4160178 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content61% 
IMG OID640596906 
Productalpha amylase, catalytic region 
Protein accessionYP_001277621 
Protein GI148657416 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCGC TCACATGGTG GCAGACGGCA GTGTTCTACC AGATCTATCC GCGCAGTTTC 
GCCGACGGTA ATGGCGACGG CATCGGCGAC TTTGCGGGCA TGATCGACAG ACTCGACTAC
CTGCGCGATC TGGGGGTCGG GGCGCTCTGG CTCTCGCCGC ACTATCCTTC CCCGAATGCG
GACTGTGGCT ACGACATCTC GGACTATACC GGCGTCGCGC CTGAATACGG AACGCTTGAT
GATTTCCGGC GTTTTCTCGA CGGCGCCCAC GCGCGCGGTA TGCGGGTGCT GCTCGATCTG
GTGCTCAACC ATACGTCCGT GGAGCATCCC TGGTTCAGGG AGTCGCGGTC GAGCCGGGAT
AACCCGAAGC GCGACTGGTA TATCTGGCGC GACCCTGCAC CTGACGGCGG TCCGCCGAAT
AACTGGTATT CGGCGTTCGG CGGTTCTGCC TGGACATTCG ATGAGACGAC CGGACAGTAC
TACTACCACT TTTTCTTCAA GGAACAACCC GACCTGAACT GGCGCAACCC GGATGTGAAG
CGGGCGATGT GGCAGGCGAT TCGTTTCTGG CTCGATATGG GAGTGGACGG TTTTCGTCTC
GATGCGATCG ACACCATCTT TGAGGACCCC GCGCTCACCC CGCACGAATC GCGGTTGTCG
CAGGTTGAGA TGCTGCGCAT CTGGCGCGAA AACCGTCCGC CGGAAGAGAC GAAAGAACTC
TGGGAGCAGT TTGCGCTGAT GTTTCGGTAT CAGGTGCAGC AACCAGGGTT GCACGAGTTG
ATGAAAGAAT TGCGCGCATT GGTGGACGAA TATCCAGGAA ATCGGGTGCT GATCGGCGAA
GGGGACGACA TTGCATACTA CGGCAACGGC AGTGATGAAC TGCACCTGGT GTTCAATTTT
CCGCTTATGC GCACCAATCG GTTGACGCCA GCATGGGTGC GTGCCAATCA GGCGGAACGT
CTGGCAGCGT TGCCCCCCGG CGCCTGGCCC TGCAACACAT TGGGGAATCA CGATGTCGGG
CGCATGTGGA CGTCATACGG CGATGGGGTG AACGATGCGG CGCTTGCCCG TCTGCACGCG
GCGATGCTGC TGACGCTGAA GGGCACGCCG GTGCTCTACA ACGGCGAAGA GATCGGCATG
ACCGATCTGT TGCTCGAACG GTTTGAACAG TTGCGCGACA ATCAGGCGGT CAATCTGTAT
CACCTGGCGG TCGGCGATGG CATCGATCCC GCTGAGGCAA TGAAGATGGC AGCAGCGATC
AGCCGTGACC GCTGTCGCAC GCCGTTCCAG TGGGCGAATG CGCCGAATGC TGGATTCAGT
CCGCCGGGCG TGGCAACCTG GTTGCCGGTC AACCCCAACT ACGCGCAGGG CGTGAATGTT
GCCGATCAGG AACAGAACCC GGATTCGCTG CTCAACTACT ACCGCCGCCT GATCGGTGCG
CGCCAGGCGA TACCGGCATT GCTGGCGGGC GACTATGCGC CGCTCCATCC TGACGAGGAT
CGCTATCTGG CGTTTCTACG CACAACGCCG GATCAGCGCT GCCTGGTTGT GCTCAACTTC
TCGCCGGAGC CGGTCACAAC CGGCTTCGAT CTGAACGGCG CCCGTTTGCG CACACTCTTT
TCAAGCCACC CCCGCCCGAC CCGCGACGAA CATCCGGAGC GCCTGACCCT GGCGCCGTTC
GAGGCATACA TCGGCGAGGT GATACGGATT GGGACAGATT GGGAATAG
 
Protein sequence
MQSLTWWQTA VFYQIYPRSF ADGNGDGIGD FAGMIDRLDY LRDLGVGALW LSPHYPSPNA 
DCGYDISDYT GVAPEYGTLD DFRRFLDGAH ARGMRVLLDL VLNHTSVEHP WFRESRSSRD
NPKRDWYIWR DPAPDGGPPN NWYSAFGGSA WTFDETTGQY YYHFFFKEQP DLNWRNPDVK
RAMWQAIRFW LDMGVDGFRL DAIDTIFEDP ALTPHESRLS QVEMLRIWRE NRPPEETKEL
WEQFALMFRY QVQQPGLHEL MKELRALVDE YPGNRVLIGE GDDIAYYGNG SDELHLVFNF
PLMRTNRLTP AWVRANQAER LAALPPGAWP CNTLGNHDVG RMWTSYGDGV NDAALARLHA
AMLLTLKGTP VLYNGEEIGM TDLLLERFEQ LRDNQAVNLY HLAVGDGIDP AEAMKMAAAI
SRDRCRTPFQ WANAPNAGFS PPGVATWLPV NPNYAQGVNV ADQEQNPDSL LNYYRRLIGA
RQAIPALLAG DYAPLHPDED RYLAFLRTTP DQRCLVVLNF SPEPVTTGFD LNGARLRTLF
SSHPRPTRDE HPERLTLAPF EAYIGEVIRI GTDWE