Gene Rcas_4167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4167 
Symbol 
ID5541678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5390600 
End bp5392324 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content59% 
IMG OID640896278 
Productalpha amylase catalytic region 
Protein accessionYP_001434216 
Protein GI156744087 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.718668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00459901 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATTCGC TCAAATGGTG GCAAACGACG GTGTTCTATC AGATCTATCC GCGCAGTTTC 
GCCGATGGGA ACGGAGATGG CATCGGCGAT TTCGCCGGGA TGATCGACCG GCTCGATTAC
CTGCGCGACT TGGGAGTCGG TGCGCTCTGG CTCTCGCCAC ACTATCCTTC ACCCAATGCC
GACTGCGGCT ACGACATCTC CGATTACACC GGCGTTGCCC CAGAGTACGG CACACTGGAC
GATTTTCGAC GTTTTCTGGA AGGCGCGCAC GCACGCGGTA TGCGCGTCCT GCTCGACCTC
GTCCTCAACC ACACGTCCGA AGATCACCCC TGGTTCCGGG AGTCACGCTC CAGCCGCAAC
AATCCAAAGC GCGACTGGTA CATCTGGCGC GATCCGGCGC CCGACGGCGG ACCGCCGAAC
AACTGGTATT CGGCGTTTGG CGGTTCTGCC TGGACGTTCG ACGAAGCGAC CGGGCAGTAT
TACTATCACT TTTTCTTCAA GGAACAGCCC GACCTGAATT GGCGCAACCC GGATGTCAAA
CAGGCGATGT GGCATGCAGT GCGGTTCTGG CTCGATATGG GGGTGGATGG GTTCCGCCTC
GATGCGATTG ACACGATCTT CGAAGACCCG AACCTGACGC CGCAGCAGTC GAAATTGTCG
CAGATCGAGA TGCTGCGCAT CTGGCGCGAG AATCGTCCGC CGGAGGAGAC AAAAGAACTC
TGGGAACAGT TCGCCCTGAT GTTCCGCCAC CAGGTGCAGC AGCCAGGATT GCACGAGTTG
ATGAAAGAAC TACGCGCATT AGTGGACGAA TATCCAGGTG ATCGGGTGCT GATCGGCGAG
GGTGACGATA TTGCGTACTA CGGCAACGGC CATGATGAGT TACACTTGGT GTTCAATTTT
CCGCTCATGC GCACCAATCG GTTGACGCCT GCCTGGATCC GCGCCAATCA GGCGGAACGA
CTGGCAGCGT TGCCGCCTGG CGCATGGCCC TGCAACACGC TGGGAAACCA TGACGTTGGC
CGCATGTGGA CGGCATATGG CGACGGCGTC CACGATGCAG CGCTTGCCCG CTTGCATGTG
ACGATGCTGC TGACACTGAA AGGCACACCG GTGCTCTACA ACGGCGAGGA GATCGGCATG
ACCGATCTGC TGCTCGAACG GTTCGACCAG TTGCGTGACA ATCAGGCGGT TAACTTGTAT
GACGCAGCGG TCAACGATGG CATTCCTGCC GATGAAGCGA TGAGGATGGC GGCAAAGATC
AGCCGGGACC GCTGTCGAAC GCCGATCCAG TGGGCAAATG CGCCAAACGC CGGTTTCAGC
CCGGCCGGTG TGACGACCTG GCTGCCGGTC AATCCGAACT ATGCTCAGGG AGTAAATGTC
GCCGAACAGG TCGGCGATCC TCACTCGCTC CTCACCTTCT ATCGTCGCCT GATCGCCGCG
CGCCAGGCGA CTCCCGCACT GTTGGAAGGC GATTACACGC CACTGCATCC AAACGAGGAG
CGTTATCTGG CGTTTCTGCG CACCACACCG GAACAGCGCT GCCTTGTGGC GCTGAACTTT
ACGGCTGAAC CGGTGACGGC AAGTTTTGCA CCTGGCGAAG ATTCACTTTT ACGCACGATT
TTCTCGACCC ATCCGCGACC GGCAGGCGAA GAAAACCCGG CACACCTGAC ACTGGCGCCG
TTCGAGGCAT ACATTGGAGA GATTCTGCCA GAGCGTCAGG GATAA
 
Protein sequence
MHSLKWWQTT VFYQIYPRSF ADGNGDGIGD FAGMIDRLDY LRDLGVGALW LSPHYPSPNA 
DCGYDISDYT GVAPEYGTLD DFRRFLEGAH ARGMRVLLDL VLNHTSEDHP WFRESRSSRN
NPKRDWYIWR DPAPDGGPPN NWYSAFGGSA WTFDEATGQY YYHFFFKEQP DLNWRNPDVK
QAMWHAVRFW LDMGVDGFRL DAIDTIFEDP NLTPQQSKLS QIEMLRIWRE NRPPEETKEL
WEQFALMFRH QVQQPGLHEL MKELRALVDE YPGDRVLIGE GDDIAYYGNG HDELHLVFNF
PLMRTNRLTP AWIRANQAER LAALPPGAWP CNTLGNHDVG RMWTAYGDGV HDAALARLHV
TMLLTLKGTP VLYNGEEIGM TDLLLERFDQ LRDNQAVNLY DAAVNDGIPA DEAMRMAAKI
SRDRCRTPIQ WANAPNAGFS PAGVTTWLPV NPNYAQGVNV AEQVGDPHSL LTFYRRLIAA
RQATPALLEG DYTPLHPNEE RYLAFLRTTP EQRCLVALNF TAEPVTASFA PGEDSLLRTI
FSTHPRPAGE ENPAHLTLAP FEAYIGEILP ERQG