Gene Rcas_3152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3152 
Symbol 
ID5540650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4088059 
End bp4090971 
Gene Length2913 bp 
Protein Length970 aa 
Translation table11 
GC content63% 
IMG OID640895273 
Productextracellular solute-binding protein 
Protein accessionYP_001433224 
Protein GI156743095 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTGC CAGACCGCGC ATGTCCCTCA CTCTGGTTCT CCTGGCGATG CACCGCTCGT 
ATCTTTTCGT CAGCGCTCGA CCAACTCTTG AACAGGCGTG GCTTCCGGCA ATTGTGGAAA
GGCAGGCATA TGCAGCGTGT ATGGATACTT CTGACCTTCT GGGCGGTGCT GCTGGCATCG
TGCGGCGCGC CGCCACCATC GGGTAATGCG ACTCCGGCAG TCCCCGGAGT AACGCCGACG
AGCGGGAGCG AGACGGTCAC GATCTCGTTT GCGGTGTGGG AGTACGAGCG CAACATCTAT
CAACCGCTGG CGGATCGCTT TATGACTGAG AACCCCAACA TCAAAATTGT GCTGGTGTCG
CTCGATGACA TCATGATGTT CGAGCCGGGC AACGAGGGAC CGACCAATCC GCTCGATGTT
CTACGGCGGA TCGTGTCAGC CGCCGATACT GCTCCCGCAT TCGCGCTGAC GCCGGAAGCG
TTTGGCACAC CGCTGCTGCT CGACCTCAAA CCGTTGATGG ACGCGGACGC TGCGTTTCAG
CGCGACGACT TCTTCCCCGG CGCCATCGAG CGCTACGCGG CGAAGGGCGG GGTCTGGGCG
CTACCGCGCT ACCATGGCGT GCCGCTGATC GTCTACAATC GCGCCTTGTT TCAGAATGCC
AATCTATCCG AACCACGCCC TGGCTGGACG TGGGATGATC TGATGTCCGC CGCCGAGCGT
CTCACCGAAC GCAGCGGTCT GACGACCCTC ACCTACGGGT TTATGGAACC AACCAATGGT
CTGCTGCCGC TCCTGGGGTT GCTCGAAAAA CAGGGGATCA ATCCACTAAC GGTCGTCGCG
GAAGAGACGG ATCTCACTGC GCCGGAGTAT ATTGCAGCAG TCGAGCGCAT CCAGGAATGG
TATCGCGCGG GTGTGCTGGT CGCACCCTAT GCGCGCGACG CCGCTTCTGA CGATCCCACG
CGCCTGGCGC GCGAAGGGCG GGTCGCACTC TGGAGCGACA TGTCCTACAT CAGCAACGAC
GATGGCTCGC CCTGGACGCC GGATTTTCCG GTCGGTCGTG CGCCATTCCC GAAGAATGCC
ATTCTCGATC CCTTCTTCAG CTCAACCGAA GGGTTCATCA TCAGCGGCGG CACAGCGCAC
CCCGACGCCG CCTGGCGCTG GATCGAGTGG CTCTCGCGCC AACCGGCGCT CGAGGACCAG
CAAAGTTTCG GTCCATTGTC GCGTTTGCCT GCCCGCCAAT CGGTGGCGCA GCAGACGGAG
TTCTGGAGCA AACTCGATCC GCCGACTGCT GAGGCGTACC GGTGGGCGAT AGAGAACAGC
GGGACGCTGC CAAGCCAGCA GTTCGATTAT ACCGTGATGG GGGCGTTGAG TCAGGCAGTT
GCGACGGTCA CCGGCGACCC AAAGGCGAAT GTGCGCCAGG CTCTGGCAGA AGCGCAGCGG
CAGATGCAGG AAGCGATTGC GCAGCGGGAA CTCACGCCAA CGCCGAAGCC AAACCTCAAT
CCGGTGGTAG TGGCTACGCC GGAGCCGCAG GAAGCGCCGG CCGGCGCCAC AACCGTGACG
TTTGGCCTTT ACGGGTACAA TCCGACCGAA CTGCGCCGCG TCGCGCGCGC GTTTCGCGAA
CAGCGTCCCG ACATCTTCGT GCAGTTGAAG CCGTTCGAGT GGACGCCCGA CTTGAAGGAG
GTCACGGCTG CCACCCTGGC GCAGACCAGC GACTGTTTTT TCTGGTTCGG TCCGCTCGCC
GGCAACGATG AAACGGCTCT GCTCGATCTG CGCCCATTGA TCGAGGCAGA CTCCACCTTC
CCGCGCGATG ACATCGTCCC GGCGGCGCTG GCACAGTACA GCCGCGAGGG GCGCATCTAC
GGGTTGCCCT ACGCGGTCAA TATGCGCAGC CTGATTTACA ACCATGCCGC ATTCGAGGCG
GCGGGAGTGC AACCGCCGCG CGCCGACTGG AAACCGGACG ATTTTCTGGC GGCGGCGCAG
GCGTTGACCA GAGGCGAAGG GAACGAGCAG CGGTGGGGGT ATGTGCCGCT GGGAGGTCCG
CAAGCCGACC TGTTGTTCTT CATCAACCAA TTTGGCGCGC GCCTGATGGT TGGCGAGGGC
AAGGACCTGC GCCCGAACTA CACCGATCCG AAGACTATCG CTGCCATCCG CTGGTACCTC
GATTTGAGCA TGGTTCACAA GGTGGCGCCG CCGCTCACCA TCGTCTATCG GCGCGACGAT
TTTGGCGGCG GGGATCGTTC CTACGAACTG GTCCAATCCG GTCGGGTCGG GATGTGGTTT
GGGTATCCTG AATCATTCCA GGGAGGGGTC GTGATTCAAC CGCTCGAACC GGGTGGTGGA
CCGGCGCCGA CTGCCGCGCC GCTGACCCCG GTCGAACGCG ACATTCGAGC TGCGCCACTG
CCGGTCGGCG GCGCCGGTGT GCCGTCGAGC GAAGTGTTTT TGCGGGGCCT GTTCATCTCG
GCGCGGACGC AGCAACCGCA GGCATGCTGG GAATGGATCA AGTTCCTTTC CGGCGACACG
TCCCTGACGT ATGGCGATAT GCCCGCCCGC CGCTCGGTCG CACAATCTGA GGCATTCATC
AAACAGTTGC CGCCGGAGCG AGTCGCGCAG TTCGAGGCGA TGCGGGCAAC CCTGGCGATG
CCGTCGCAGA ACGCCGGTGA TGCTAACGTG ATCTACAGCC AGCACACCGA TCCCTACTGG
TTCTTTAAGG CGCTTAGCGC CGTGGTGGAA AAGGGCGCAA GCCTCGAAAA GGAACTTGCC
GAAGCGCAGC GCTTTGCAAC CGCCTTTGCC GAGTGTATGA ACCGTGAGGA TGCGCGTGCG
CCGGCATGCG CAAAAGAAGT CGATCCAGAC TATAAAGGGT ACAATGTCGA GGAGGGCGAT
CCGACGCCTC TTCCCGCCGG CGCTGCGCCG TAG
 
Protein sequence
MTLPDRACPS LWFSWRCTAR IFSSALDQLL NRRGFRQLWK GRHMQRVWIL LTFWAVLLAS 
CGAPPPSGNA TPAVPGVTPT SGSETVTISF AVWEYERNIY QPLADRFMTE NPNIKIVLVS
LDDIMMFEPG NEGPTNPLDV LRRIVSAADT APAFALTPEA FGTPLLLDLK PLMDADAAFQ
RDDFFPGAIE RYAAKGGVWA LPRYHGVPLI VYNRALFQNA NLSEPRPGWT WDDLMSAAER
LTERSGLTTL TYGFMEPTNG LLPLLGLLEK QGINPLTVVA EETDLTAPEY IAAVERIQEW
YRAGVLVAPY ARDAASDDPT RLAREGRVAL WSDMSYISND DGSPWTPDFP VGRAPFPKNA
ILDPFFSSTE GFIISGGTAH PDAAWRWIEW LSRQPALEDQ QSFGPLSRLP ARQSVAQQTE
FWSKLDPPTA EAYRWAIENS GTLPSQQFDY TVMGALSQAV ATVTGDPKAN VRQALAEAQR
QMQEAIAQRE LTPTPKPNLN PVVVATPEPQ EAPAGATTVT FGLYGYNPTE LRRVARAFRE
QRPDIFVQLK PFEWTPDLKE VTAATLAQTS DCFFWFGPLA GNDETALLDL RPLIEADSTF
PRDDIVPAAL AQYSREGRIY GLPYAVNMRS LIYNHAAFEA AGVQPPRADW KPDDFLAAAQ
ALTRGEGNEQ RWGYVPLGGP QADLLFFINQ FGARLMVGEG KDLRPNYTDP KTIAAIRWYL
DLSMVHKVAP PLTIVYRRDD FGGGDRSYEL VQSGRVGMWF GYPESFQGGV VIQPLEPGGG
PAPTAAPLTP VERDIRAAPL PVGGAGVPSS EVFLRGLFIS ARTQQPQACW EWIKFLSGDT
SLTYGDMPAR RSVAQSEAFI KQLPPERVAQ FEAMRATLAM PSQNAGDANV IYSQHTDPYW
FFKALSAVVE KGASLEKELA EAQRFATAFA ECMNREDARA PACAKEVDPD YKGYNVEEGD
PTPLPAGAAP