Gene Rcas_0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0163 
Symbol 
ID5537624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp198519 
End bp199694 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content58% 
IMG OID640892327 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001430315 
Protein GI156740186 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCGAC ATTTGTTGTG GATGATGCTG TTTCTGACGA TTGCCGCGAC GGCATTGCCT 
GGAGGGAGCG CCACATATGC CGCGCCCGGC TACGAAATCC AGGTGGTGAA GACCGGTCTC
GATCGACCGT GGAGTATCAA TTTTGCGCCT GATGGGCGGC TATTCTTTAC TGCGCGCAAC
AGTGGTCGCC TGTATGCCCT GAATACCGCA ACCGGCAATG TACAGACGTT CAGTGGTCTG
CCGCCTGCCC GATTTCGCGC CGAGCAAGAA GCCGGCATGA TGGGAATGGC GCTCGACCCC
GATTTTGCGA CAAACGGCTG GGCGTATATC TGCTACAGCT TCTTCGATAA CGACGGCAAT
CGTCGGAACC GCCTCTCGCG GTTCACGGTC AATCCGGTAT CTGGCGCCGT TTCTGGAGAG
CGCGTCCTGA TTGAGACGAT GGTTGGCGCA CTCTACCACA ATGGCTGTCG CGTGATCGTC
TCGCCTGATA ACCGGTATCT GTTCGTATCG ATGGGCGATG CCACCGTTCC ATCACTCGCG
CAGGATCTCG ACAGCACCGG CGGCAAGACC TTCCGCATCT TCAAAGACGG CAGCATTCCA
ACTGATAACC CCTTTTACGA CAACGGTCGG ATACCGCGTT CACTGATCTG GACGTATGGG
CACCGCAACC ATCAGGGGCT GGCATTCCAC CCGACGACCG GCGACCTCTG GAGCACCGAG
CACGGACCGG AGATCATGGA CGAACTCAAC GTCCTGATCG CCGGGCGCAA CTACGGCTGG
GGTTGGGGGA GCGGACCGCA TTATTGCCTG GGAACGGTCA ACTGCGGCAG TGTGCCCGAT
TTCATGCCGC CGGTTGCAGT GTTTAACCCT GAAAGAACGG TTGCCACGTC CGACATGGTT
TTCTACACAG GCAGCGCATT CCCTGAATGG TCTGGCGATC TCTTCTTCGT CACGTTGAAA
ACCGGCAGGC TCTACCGCCT GAAGATCGAC AATCGCACGA TTGTCGAGCA GGAGATTCTG
ATCGATGGTA CGTATGGTCG CCTGCGCGAT GTGACCGTTG GACCGGATGG GTTTCTGTAT
ATATCCACCG ACGAGACAAG TGCGCAATTG CTGCGCATCC GTCCGACCAT CGAGCGCCCT
TATCGCGTCC AACTGCCACT CGTTATGCGG GGGTAG
 
Protein sequence
MPRHLLWMML FLTIAATALP GGSATYAAPG YEIQVVKTGL DRPWSINFAP DGRLFFTARN 
SGRLYALNTA TGNVQTFSGL PPARFRAEQE AGMMGMALDP DFATNGWAYI CYSFFDNDGN
RRNRLSRFTV NPVSGAVSGE RVLIETMVGA LYHNGCRVIV SPDNRYLFVS MGDATVPSLA
QDLDSTGGKT FRIFKDGSIP TDNPFYDNGR IPRSLIWTYG HRNHQGLAFH PTTGDLWSTE
HGPEIMDELN VLIAGRNYGW GWGSGPHYCL GTVNCGSVPD FMPPVAVFNP ERTVATSDMV
FYTGSAFPEW SGDLFFVTLK TGRLYRLKID NRTIVEQEIL IDGTYGRLRD VTVGPDGFLY
ISTDETSAQL LRIRPTIERP YRVQLPLVMR G