Gene Rcas_1174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1174 
Symbol 
ID5538640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1518896 
End bp1520263 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content60% 
IMG OID640893306 
Productextracellular solute-binding protein 
Protein accessionYP_001431289 
Protein GI156741160 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.273661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.401546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGATGT GTCATTCTCC GCATGACCAT CTTCACAATC CGAAAGGCAG GGAAGCTATG 
TTCACGACAA AGTGGTTGAA ACTGTTCGGC GTCTTTTCGA TTCTCGCGCT CGTGCTGGCT
GCCTGTGGCG GCGCACCGGC GACAACTCAG CCCACTGCCG CCCCGGCGCA ACCAACGACC
GCACCGGCGC CGACCGGCGG CGGTAAACTC GAAATCTTCA GTTGGTGGAC AAATGGCGGC
GAAGCCGATG GGCTGAACGC CATGTTCGAC ATCTACAAGC AGCAGAATCC TGGCATCGAA
ATTGTGAATG CCACTGTCGC CGGGGGCGCT GGCACGAATG CCAAGACCGT ACTGAAGACT
CGCCTTCAGG GTGGGCAACC GCCGGATAGC TGGCAGGTGC ATGCTGGCAA GGAATTGACG
GCATATGTCG ATGCCGGTCA GATGGAACCG CTGACGCAGT TCTTCAAGGA GCAGGGCTTC
GACAAGGTCA TGCCGCCGAA ACTGCTCGAA CAGATCACCT ACAACGGTGA AATCTGGTCG
GTGCCGGTCA ATATCCACCG ATCGAACGTT CTCTGGTACA ACATCAAGAT TTTCCAGGAG
AACGGTCTGA CCCCGCCCAA GACTATCGAC GACTTCTTCA CAGTGGCTGA GGCGCTTCAG
GCGAAAGGCA TCATTCCGCT CGCAGTCGGC GGCAAAGACA AGTTCGAGAC GCCGCACCTG
TTCGAGAGCG TGCTTCTGGC GGTCTTTGGA CCGGACGATT ACGCGAAACT GTTCCAGCCC
GGCGCCGACT GGAGCGATCC GCGTGTTCGC CAGGCAGCCG AGATTGCGAA ACGAATGTTG
GAGTACTCCA ACAGCGACCG CTCGTCGCTG GGATGGGCGG ATGCCGCACA ACTCGTGCTC
GACGGCAAGG CGGCCATGAC CATTATGGGC GACTGGGCGC ATGGCTACTT CATCAGCAAG
GGCGCGAAAG TCGGTGTTGA CTATGGCTAT GCCGCAGCGC CGGGCAACGA CGGCGTCTTC
ATGTGGCTGT CGGACAGTTT TGGTCTGGCG AAGGGCGCGC CGAACCCGGA GCAGGCGAAG
GCATGGCTGG CGCTCTGCGG CTCGCGTGAG GGGCAGGACG CCTTCAACCC GAAGAAAGGG
TCGATCCCGG CGCGCACCGA TGCGAATGTG AGCCTGTACG ACGAGTATCT CCAGTACTCG
ATCAAAGCCT TCGGCAGCGA GAAACTGGCG CCGAGCGTTG TTCACGGCGC GGCTGCTCCT
GAAGCGTATA TGACCGAGTA CGGCAATGCC CTGAACGTCT TCGCCAGCGA CCTCGACGTT
GATGCGGTCG TCGAGGCGTT GCAGGATGCT GCGAAAGACC TGAAGTAA
 
Protein sequence
MVMCHSPHDH LHNPKGREAM FTTKWLKLFG VFSILALVLA ACGGAPATTQ PTAAPAQPTT 
APAPTGGGKL EIFSWWTNGG EADGLNAMFD IYKQQNPGIE IVNATVAGGA GTNAKTVLKT
RLQGGQPPDS WQVHAGKELT AYVDAGQMEP LTQFFKEQGF DKVMPPKLLE QITYNGEIWS
VPVNIHRSNV LWYNIKIFQE NGLTPPKTID DFFTVAEALQ AKGIIPLAVG GKDKFETPHL
FESVLLAVFG PDDYAKLFQP GADWSDPRVR QAAEIAKRML EYSNSDRSSL GWADAAQLVL
DGKAAMTIMG DWAHGYFISK GAKVGVDYGY AAAPGNDGVF MWLSDSFGLA KGAPNPEQAK
AWLALCGSRE GQDAFNPKKG SIPARTDANV SLYDEYLQYS IKAFGSEKLA PSVVHGAAAP
EAYMTEYGNA LNVFASDLDV DAVVEALQDA AKDLK