Gene RoseRS_1356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1356 
Symbol 
ID5208308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1663500 
End bp1665551 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content63% 
IMG OID640594967 
Productglycoside hydrolase, clan GH-D 
Protein accessionYP_001275706 
Protein GI148655501 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTTC CGTCCCGCCT GGATCTCTCG GACGCGGGGT TCGCCTGCGT GCTGGACATG 
GCACCCCGTT TTGCGCTGGC AGGTCTGGGG CTGGCGGGCG AACATCGGGC GCTTGAGTCG
TCGTCGCTCT TCCTGGCAAT CATTGATGCG CAGGTTGTCG ATGCGCAGAC GCCAAACCTG
TACGTGCGGA ATGTGTCGGT GGACGATGCA ATCCCTGGAC GACGCCATGC GCGCATCGAA
CTGCTGCCTG GCGGGTTGGG AGTCGTGATC GATTACCATA TCGTGCGCTA CAGCGATGCG
TTTGCTATCG AGACCTGGAT CGTCGTGCGG AATGAAGGGG CATTGCCGCG TCGGGTGACG
CGCCTGGATT CACTTGCGCT CGATCTGCTC CCCGGCAGGT ACGATCTGCA GGCGTATACT
GGCGCGTGGG GCGCCGAGTT CGAACCGCAG TCGATGCCGC TGACATCCCC GGTCACCCTC
GAAAGTCGTT CTGGTCGCTC GTCGCACGGT CACCATCCCT GGTTTGCCCT GGTGTGCGAC
GGTCGCTCGA TCATCTCCGG CGCAGTTGCC TGGTCAGGGA ATTGGGCGAT CCGGTTGATG
CCGCGCCTGG AGGAGGTCGT CGCACTATCG GCAGGGTTGC ACGATTGGGA ATTCGCTGTC
GATCTTGCGC CCGGCGCCTC GGTTGAGGCG CCGCCGGTCG TGCTGGTGTT TGCCCGTGGT
GATGATCTCG ACGAAGCAGC GGTTCAGTTT GCGCGTCTGG GGCGACGCTT CTGGTATCCG
CGCAATGCGC TCGCCGACCG GTTGCCGGTC GAGTGGAACC ACTGGTGGGC GTATGAAGAT
CGGGCGCTCG ACGAGGCGAC CTTCCGCGCC AATGTCGATG TTGCCGCCCG GATGGGCATC
GAGGTGTGCA CGCTCGACGC CGGATGGTTT GGCGCGTCCG ATGCTGGGAC GCACTGGTAC
GACCAGCGCG GCGACTGGGA GATGGTCAAT GCGGTGCGAT TTCCTTCCGG CATTCGCGCG
CTTGCCGATG ACGTTCACGC ACGTGGCATG CGCTTCGGCA TCTGGTGCGA GATCGAGGGG
TTGGGCGTCC GCGCGCGACT GGCGGAAACG CATCCTGATT TCGTGGCGAT GCGTCATGGA
AGTCGGATCG GGTATGTGTG CCTGGGCAAT CCGGCAGCGC AGCAATGGGC GTTCGAGACG
CTTGATCATC TCATCCGCGA CTACGGGTGC GACTGGATCA AACTCGATTT CAATCTCGAC
CCTGGCGCCG GGTGCAACCG CACCGATCAC GGTCACGGCG CCAGAGATGG GTTGTATGCG
CACTATCGCG GCTACTATGC GCTGCTTGAT CGAGTGCGCA GCGTTCATCC CGACGTGGTG
CTGGAAAACT GCTCATCGGG CGGTCTGCGG ATCGATCCGG GCATTGCGCG TCGCACCCAC
ATGGCGTTTT TGAGCGATCC CGACTGGCCC GAACATAGTT TGCAGGTCTT CTGGGGCGCC
ACTCAGATGC TGGCGCCGGA CGCCTGCCTG CACTGGAGCT ACTGCGAGTG GTCGTTCGCC
AGGCATCCGA GCCAGACGTT CAATCCGCGC GATCCGTCGC TTCAGCCGCA TCAGGTCGAT
TTCTATACCC GCATTTCAAT GCTGCGCCGC TTCGGGTTTT CGCAGCGATT GCCCGATCTG
CCGGACTGGG TTGCGCAGCG TTATGCGGAT CACATCGCCT TCTACAAGGC GGTTATGCGG
CGTTTCGTGC GTGAGGCGGA CATGTACCAT CTGACGGGGC AACCGCTCGG TGAAGGACGC
GGTGACCGCT GGGCCGGTTT TCAATATCGG ATGCCCGACG GCAGCGAGCA TCTGGTCGCC
GTCTTTCGTC TGCCGGGCGC TGAACCATTG CGTGTGCTGC GCCTGAAACA TCTCCATCCA
GAGCGTATCT ACACATTGCT CTGGGTGGAT TCCGGTCAAC AGACGCAGGC AAGCGGCGCT
GAATTGATGG ATACCGGGTT GCGCTTCGAT GACCTGCCCG AAGAAGGCTC GGCGCTGGTG
CGGATCAGGT GA
 
Protein sequence
MTLPSRLDLS DAGFACVLDM APRFALAGLG LAGEHRALES SSLFLAIIDA QVVDAQTPNL 
YVRNVSVDDA IPGRRHARIE LLPGGLGVVI DYHIVRYSDA FAIETWIVVR NEGALPRRVT
RLDSLALDLL PGRYDLQAYT GAWGAEFEPQ SMPLTSPVTL ESRSGRSSHG HHPWFALVCD
GRSIISGAVA WSGNWAIRLM PRLEEVVALS AGLHDWEFAV DLAPGASVEA PPVVLVFARG
DDLDEAAVQF ARLGRRFWYP RNALADRLPV EWNHWWAYED RALDEATFRA NVDVAARMGI
EVCTLDAGWF GASDAGTHWY DQRGDWEMVN AVRFPSGIRA LADDVHARGM RFGIWCEIEG
LGVRARLAET HPDFVAMRHG SRIGYVCLGN PAAQQWAFET LDHLIRDYGC DWIKLDFNLD
PGAGCNRTDH GHGARDGLYA HYRGYYALLD RVRSVHPDVV LENCSSGGLR IDPGIARRTH
MAFLSDPDWP EHSLQVFWGA TQMLAPDACL HWSYCEWSFA RHPSQTFNPR DPSLQPHQVD
FYTRISMLRR FGFSQRLPDL PDWVAQRYAD HIAFYKAVMR RFVREADMYH LTGQPLGEGR
GDRWAGFQYR MPDGSEHLVA VFRLPGAEPL RVLRLKHLHP ERIYTLLWVD SGQQTQASGA
ELMDTGLRFD DLPEEGSALV RIR