Gene RoseRS_1360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1360 
Symbol 
ID5208312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1668880 
End bp1670904 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content60% 
IMG OID640594971 
Productglycoside hydrolase, clan GH-D 
Protein accessionYP_001275710 
Protein GI148655505 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCG TTGGATCGGG TGACTCGTCT GAGACCTGGT CGATTGCGAC CGATCAGGTC 
GATCTGGTTG TGTCGCTGGC GCACGGGCGC CTGACCTGGA CAATCGGCGG CGCAACTGGA
GGACTCGACC TGTCATCAGC GACCGGAACC TTCCTGCGCG TCAATGGTCA GACGCCGGTG
TGGACGCAGA TCGTCAATGT CGCTGAGCAC TCCGCCGATG ATGGCGGTCG GCTGCTGACC
CTGACGCTCG ATACGGCGGA TGGCAGAATG CGCTTGACTC GCCATTTCCA TCTGTTCGCC
GATCATCCAT TTGTGCGCAC CTGGGCGACG CTGGAAAACC GGAGCAGCGC CGTCCTGCTG
ATCGATCAGT GCGACATCCT GACGCTTGCA CCGCATGGCG AGCCGCCGCT GTATCTGTTT
CACGTCGAGC AGTTCAGCTG GAACTATCGG CGCGACTTTT TCAGCCAGCA TGAAGTATGG
TTGCGACCGG GGTGCGCTCC CCACGAGATC CGCATGGGTT CGTACCCTGC CCACCACTGG
GGTCCGTCCA GTTGCGCATG GTTCGCGCTA CGCGACGGTT TGCCCGATTG GAATGAAGAT
CCGCCGAAGC GTGGGCGCGG CATCGTGTGC GGCATCGAGT TCAACGGCAA GAGCCGATTG
CGCGCCTGGG CAACGACGGA ACAGGTGAGT CTTGTGAGCC AGATCGATGA TCTGGCGCAC
CGACTCGCGC CAGGTGCGAT CTTCGAGATA CCGGCGTATT TCGTCGGGCG CTTCGAGGGC
GATTGGGACG AGGCGGGGTA TGTGACGCAG CGCTTTGCCG GGGCGCACGT CCATCCCCCG
ATGCCCGATG ATCGCTACCC ATGGGTACAG TACAACTCCT GGCGGTATGA GCAGAACATC
AATGAAGAGC AGCAACTGGC GGCAATCGAC CGCTGCGCTG AATTGGGCAT CGAGCTGGCA
GTGATGGACC TCGGATGGGC GCGGATGATT GGCGACTGGC GCCCTGATCC GGTCAAGTTT
CCGCGCGGGT TGCGTCCGCT TGTTGAGCGT GCACATGCGT ATGGTATGCG GTTCGGCGTT
CACGTTGCAC TGGCGCAGTG CAATCCCGAT GCGCCGGTTG CCAGAGAGCA TCCTGACTGG
CTCATCCATA CCGGGAATGA TTACTACGGC GCAGGTCCGC TCTGCCTGGG GCACGAGCCG
TGTCGCGCCT GGCTCATCGA GCAACTCATA CGATTGATCG ACGAAGAGGG GATCGATTAC
ATCATCCAGG ATGGAGAGGA CATGGTGAAA CGGTGCGAGC GGAGCGATCA TACGCATGCG
CCGGGTGACA GCAACTATGC CAACTCCCAG TATGGGCTGG ATATCGTGAT CGAGTCGCTC
CGCCGCGCCC GTCCACATCT GGTGCTCGAA AATTGTGAGG ATGGCGGTTG TATGATGACG
TACAAAATGG CGCGACTGTA CCATACGAGC ATCACGGTGG ACAACACATC GTCCTACGCG
ACGCGACAGG GAGTCTATGG CGCTTCCTAT CCATTTTCGC CGCGCTACAG CGTGCGCTAT
ATGCAGGACG ATCCTTCGCG CTACACGCTG CGCAGTTCGA TCTTCGGCGG ACCACTCATC
CTGATGCAGC GCGTCACGGA ATGGAACGAA GCGCAGATGG CAGAAACCAG GCAGGCGATT
GAGCAGTACA AGGCGTTGCG CCATCTGATC CGTTCGGCGA AGATCATCCA CCTGAAAGCG
CCGCAGCACA ACATCGACGG TCTGGGGTGG GGATGGGACG CCATTCAGGC AGTCGCGCCG
GATACATCGC GCAGCGTCAT AATGGTCTAC CGCGCACGGG GCGATCTGGC TGAACGCACA
TTCAGGCCGC GCGGCTTACT CCCAGAGGCG CACTATGCGG TTCGGTTCGT TGATAGCGGA
CACACCCTCC AGTGCGCCGG CGCCGAACTG GAGCGCGATG GGATAACCAT AACGCTCGAT
GAATTCGACT CGGAGATTGT GATGCTGGAG AGGGTTGAAG GTTGA
 
Protein sequence
MTRVGSGDSS ETWSIATDQV DLVVSLAHGR LTWTIGGATG GLDLSSATGT FLRVNGQTPV 
WTQIVNVAEH SADDGGRLLT LTLDTADGRM RLTRHFHLFA DHPFVRTWAT LENRSSAVLL
IDQCDILTLA PHGEPPLYLF HVEQFSWNYR RDFFSQHEVW LRPGCAPHEI RMGSYPAHHW
GPSSCAWFAL RDGLPDWNED PPKRGRGIVC GIEFNGKSRL RAWATTEQVS LVSQIDDLAH
RLAPGAIFEI PAYFVGRFEG DWDEAGYVTQ RFAGAHVHPP MPDDRYPWVQ YNSWRYEQNI
NEEQQLAAID RCAELGIELA VMDLGWARMI GDWRPDPVKF PRGLRPLVER AHAYGMRFGV
HVALAQCNPD APVAREHPDW LIHTGNDYYG AGPLCLGHEP CRAWLIEQLI RLIDEEGIDY
IIQDGEDMVK RCERSDHTHA PGDSNYANSQ YGLDIVIESL RRARPHLVLE NCEDGGCMMT
YKMARLYHTS ITVDNTSSYA TRQGVYGASY PFSPRYSVRY MQDDPSRYTL RSSIFGGPLI
LMQRVTEWNE AQMAETRQAI EQYKALRHLI RSAKIIHLKA PQHNIDGLGW GWDAIQAVAP
DTSRSVIMVY RARGDLAERT FRPRGLLPEA HYAVRFVDSG HTLQCAGAEL ERDGITITLD
EFDSEIVMLE RVEG