Gene Rcas_3566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3566 
Symbol 
ID5541067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4655754 
End bp4657118 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content59% 
IMG OID640895685 
Productextracellular solute-binding protein 
Protein accessionYP_001433633 
Protein GI156743504 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.42847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.297516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACAC GCACCCGTCT GAGTCGACGA CAGTTTCTGC GCAGTGCAGC CGTGGGAGGA 
GCAGCGCTTG CCTCCGGCAT TCTGGCGGCT TGCGGCTCGT CGCCAACAGC GCCCACAACC
GGTAACCCGA CAGCAGCAGC GCCGACGCAG GTCTCCAGCG AACCAACGAA GATTCGTGCG
CTTATGTGGA GTAATGGACC GGTCATCGAC GAGAACTTCC GAGTCCGCGC GCAGATGTTC
AACGAAGCGT TCAAGGGGCA GTACGATCTC GATCTGCAAC TTCTGCCCTA CGACCAGTAC
TGGCCCCGCA TCGACCTGGC ATATGGCTCG AAGAACCCAT ACGACCTCTA CTTCTTCGAC
GTGCAGGCAT ACGGACACTA CCGGGCAGGG TTGCTCGCCA ATATCCAGCC GTATGTCGAT
CTGGCGCCGG AACTGATGAA CGCCGAGGAG TATCCGGTGG CACTGTACGA TGCCTGGCGC
TTCGACGGCA GCAATCTCTA CGGCTTGCCG GAAAATATCC AGGTGCTGGC GCTCTACTAC
AACCGTGATC TTTTCGATGC CGAGGGGCTG GCATACCCCG ACGAGACCTG GACGTGGGAC
GATGTGATCA ATGCCGCCAC GAAACTGACG AAGCGCAGCG GCGATGAGAC CACGCAGTGG
GGGATGGATG TCGGCGTGAT GGATATCTGG TGGGGCGCGC AGACGCTGGC GTGGGCGATG
GGTGGCGGTT TTTTCGATAA GATCGTCGAG CCGACGAAGT TTCAGGTCAG CGATCCGGTC
AATGTGCAGG CGCTCACCTT TCTCCGCGAC CTGATCTTCG AGTATAAAGT CGCTCCCACC
AAAACCCAGC GTTCCGCGAC AGCACAGGAT ATTGGCATTT TCCAGACCGG CAAGGTGGCG
ATGTTCTTCG ATGGCAGCTG GGCGATCAGT GGTTTCCAGG ATGTGCCGTT CAAGTGGGAT
ATGGCGCCGT TGCCCATGTG GAAGGATAAG CGCGTCTCCG CTTACTGGCT TGGCGGGCAG
GTCATTCCGA AAGACTCGAA GGTCATCGAC GCCGCCTTCG CCTTTTCGCG CTGGTCGGCA
ACAACGTATC AGAAGACGAT GGCGTCCAAC CACGACTGGA TACCAATCGC GCGTTCGGCG
CGCGAGTCCG AGGAGATGTA TGTCGGGCAA CCAGCCGGGC TGCGCAAAGT GCTCGGCACT
ATCGAAGGTG CACGACTTGG TGATTTTTAC TCACGCAACA ATCAGCAGAT CTTCGGCGAG
GTGCTGCTGC CGACGTTCGA TCAGTTGTTC CTCGGCAACC TGACGCCGGA AGAGGCGGCA
AAGAAGATCG ATGAAGAAGC CAATGCGCTT CTTGCGAAAG GATGA
 
Protein sequence
MGTRTRLSRR QFLRSAAVGG AALASGILAA CGSSPTAPTT GNPTAAAPTQ VSSEPTKIRA 
LMWSNGPVID ENFRVRAQMF NEAFKGQYDL DLQLLPYDQY WPRIDLAYGS KNPYDLYFFD
VQAYGHYRAG LLANIQPYVD LAPELMNAEE YPVALYDAWR FDGSNLYGLP ENIQVLALYY
NRDLFDAEGL AYPDETWTWD DVINAATKLT KRSGDETTQW GMDVGVMDIW WGAQTLAWAM
GGGFFDKIVE PTKFQVSDPV NVQALTFLRD LIFEYKVAPT KTQRSATAQD IGIFQTGKVA
MFFDGSWAIS GFQDVPFKWD MAPLPMWKDK RVSAYWLGGQ VIPKDSKVID AAFAFSRWSA
TTYQKTMASN HDWIPIARSA RESEEMYVGQ PAGLRKVLGT IEGARLGDFY SRNNQQIFGE
VLLPTFDQLF LGNLTPEEAA KKIDEEANAL LAKG