Gene Rcas_1879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1879 
Symbol 
ID5539357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2412968 
End bp2414326 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content62% 
IMG OID640894016 
Productextracellular solute-binding protein 
Protein accessionYP_001431987 
Protein GI156741858 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.29051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA AACTCTCACG ACGCAGGTTC CTCAAGGTTG CTGCCGCAGG CGCGGGCAGC 
ATTGGCGCAG CAGCGCTGCT GGCGGCGTGC GGCGGCGCGG CGCCGCAGGG CGGGCAACCG
ACAGGCGGGC AGGCGCAACC TGCCGCGCCG GTTCAGGGTG ACGCCGTTGT CACCGAGATC
ACCTTCTGGT GGTGGGATCA GGTCGGTGAG GTGTGGAAAG AACCGTTTGA GAAGGCGCAC
CCCAACATCA AACTCAACTT CGTCAACACC CCCTTCGCCG ACGCGCACGA CAAACTCCTG
ACCTCCTTCG CCGCCGGAAG CGGCGCTCCC GATGTCGCTT CAATTGAGAT CGGGCGTGTC
GGCAATTTCA CCGCCAAAGG CGGCCTCGCC GATCTGCTGG CGCCGCCGTT TGATGCCGGC
AGCCTGAAGA ACGATATGGT TGCCTACAAG TGGACACAAG GCTCCACTGC CGATGGCCGT
CTCGTCTGCC TGCCGTGGGA CATCGGACCG GCTGGCGTTT GGTACCGCAC CGACATTTTC
GAGGCGCTTG GGTTGCCAAC CGATCCAGAG GCGGTCGAGG AGTTGATCGG CGGTCCCAAC
CGCACGTGGG ACGACTTCTT CGCGTTCGCA AAGCAACTGA AAGAGAAGAG CGGCGGCAAG
ACCTCGCTCT TCGCCGATGC CGGCACCGAC ATCTATGGCG CCGTTTACCG CCAGCAGGGT
GAGGGATATG CCGATGGCAA CAAAGTGCTG ATCGAAGAAA AGGCGACCCG TCCATTCCAG
CTGGCTGCGC GCGCGCGCAA AGACGGGATC GATGCCAATA TTCCCTGGTG GGGCGCCGAG
TGGCAGACCG GCTTGAAAGA CAACGCCTTT GCCGGGATGG TCATCGCATG CTGGATGCAG
GGCGGGTTGA CGCGTGAGCA GCCCGATCTG GTTGGGAAGT GGCGCGTGAT ACGCGCTCCA
GAAGCCAACT ATAACTGGGG CGGCTCGTTC ATGGCGATCC CGGAGCAGAG CAAGAATAAA
GAAGCCGCCT GGACCTTCGT CAAATGGGCA TGCGCAACGG CAGAGGGGCA GAACATCATG
TTCAAGGCGT CGGGTGTGTT TCCGGCATAT AAGCCCGCCT GGCAGGACCC GCTGTACGAT
GAGCCGGTGC CGTTCTTCGG CGGTCAGCGC GCCTATCGTC TCTGGACGGA GATCGGCGAC
AACATCAAGG CGATTTTCCG CACGCCGCAC GATCTCCAGC TCGATGACAT CGTTGGCGCG
GAATTGACCA AAGTGCTGCA AGAAGGGAAA GACCCCGTCC AGGCGGCGAA AGACGCCGAG
GCGGAAGCGA TTCGGCGCAT TCCCGATATG CAGGCGTGA
 
Protein sequence
MTTKLSRRRF LKVAAAGAGS IGAAALLAAC GGAAPQGGQP TGGQAQPAAP VQGDAVVTEI 
TFWWWDQVGE VWKEPFEKAH PNIKLNFVNT PFADAHDKLL TSFAAGSGAP DVASIEIGRV
GNFTAKGGLA DLLAPPFDAG SLKNDMVAYK WTQGSTADGR LVCLPWDIGP AGVWYRTDIF
EALGLPTDPE AVEELIGGPN RTWDDFFAFA KQLKEKSGGK TSLFADAGTD IYGAVYRQQG
EGYADGNKVL IEEKATRPFQ LAARARKDGI DANIPWWGAE WQTGLKDNAF AGMVIACWMQ
GGLTREQPDL VGKWRVIRAP EANYNWGGSF MAIPEQSKNK EAAWTFVKWA CATAEGQNIM
FKASGVFPAY KPAWQDPLYD EPVPFFGGQR AYRLWTEIGD NIKAIFRTPH DLQLDDIVGA
ELTKVLQEGK DPVQAAKDAE AEAIRRIPDM QA