Gene Rcas_1695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1695 
Symbol 
ID5539173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2187594 
End bp2188829 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content60% 
IMG OID640893834 
Productextracellular solute-binding protein 
Protein accessionYP_001431805 
Protein GI156741676 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0207768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000479364 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACACGT CACCCAATGC TCGAACAATC GCACTTCTGT TGCTGACCCT CATTCTCGCC 
GGGTGCGGCG ATCTCGGCGG ATTGCTCGGC AACCAGCCCA CTCCTGCACC GATCATCCTC
ATCGCAACCG CCACACCCGT ACCGCCGAGT CAGGCGACCC CGACGACAGA CATTGTTCCA
TCGCCGACAG TTGCGCCACC CGACACTCCG GTGGCAACCT CCGTCCTACC GACTGCTGCG
CCTCCCACAC CCACACCCGC ACCGCAAAAA ATCCTGGCGC GCGTCAAAGA GCGTGGCTAT
CTGATCTGTG GAACGAACGC CGATCTGCCG GGGTTCGGCT TCTACGACAA CGTGCGCCAG
GCGTGGAGCG GCTTCGATGT CGATTTCTGC CGCGCCGTCG CTGCTGCCAT CTTCGGTGAT
GCCACAAAAG TGGAGTTCGT CGCGCTCGGC ACCGGACCAG GACCCAACAA CCGGTTCGAT
GCTGTGCGTG AAGGGCGCGT CGATGTCCTG TTCCGTAACA CGACATGGAC ACTTGGACGC
AACATCAGCG GGTTGGCATT CGGTCCCACC ACCTTCCACG ATGGTCAGAC CTTCATGGTG
CGCATCAGGG ACCGGATCAC CAAACTGGAA GACCTCGCAG GCAAAGTCAT CTGTGTGGCG
AAAGGCACCA CCAGCGAGCA AAACCTGAAC GACGACTTTG CCGCGCGCGG CATTCAGTTC
ACTGCCCGCG TCCTCAATGG CGAAGACGAA CTCTACCCGG CGTATGATGA AGGGGAGTGC
GACGCAGTGA CCAGCGACAG TTCACAACTG GCAGCCAAAC GTCAGCAACT CAAGAATCCC
GCCGACCACA TCATCCTCGG CGACCGCATC TCACGCGAGC CGCTCGGTCC GGTCATCGCC
CGCGACGACA ATCAGTGGCT CGACGTGATC AGCTGGACGG TCTTTGCCAC GATTTACGCC
GAAGAACTGC GTGTTGATCA GCGCAATGTC GATCGTCTGC GCGCCAGCAC GACCGATCCG
CGTATCAAAC GGCTGCTAGG GCTGGAAGGA AACTTCGGCG AGGGATTGGG GTTGCCGAAC
GACTTCGCCT ATCAGATTAT CAAGCAGGTC GGCAACTACG GCGACATCTA CAACCGTAAC
CTGGGACCGA ACACCGTCAT CAATCTGGAT CGCGGTCCGA ACAAAGTCTG GAACCTTGGC
GCTGGCGGCG TGCTTGCCTC CCCGCCGTTT CGTTGA
 
Protein sequence
MHTSPNARTI ALLLLTLILA GCGDLGGLLG NQPTPAPIIL IATATPVPPS QATPTTDIVP 
SPTVAPPDTP VATSVLPTAA PPTPTPAPQK ILARVKERGY LICGTNADLP GFGFYDNVRQ
AWSGFDVDFC RAVAAAIFGD ATKVEFVALG TGPGPNNRFD AVREGRVDVL FRNTTWTLGR
NISGLAFGPT TFHDGQTFMV RIRDRITKLE DLAGKVICVA KGTTSEQNLN DDFAARGIQF
TARVLNGEDE LYPAYDEGEC DAVTSDSSQL AAKRQQLKNP ADHIILGDRI SREPLGPVIA
RDDNQWLDVI SWTVFATIYA EELRVDQRNV DRLRASTTDP RIKRLLGLEG NFGEGLGLPN
DFAYQIIKQV GNYGDIYNRN LGPNTVINLD RGPNKVWNLG AGGVLASPPF R