Gene Rcas_0113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0113 
Symbol 
ID5537573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp135103 
End bp136470 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content62% 
IMG OID640892278 
Productextracellular solute-binding protein 
Protein accessionYP_001430267 
Protein GI156740138 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.270734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGA TCAACCGACG ACAATTCCTG CGTGGTGTTG CAGCAGGCGC CGGTGCGCTG 
ACACTGGCGG CTTGCGGCGG CGCAGCGACT ACATCGCCGA CCGGTGAGCC TGCTGCCCCA
CCGGCGACTG CCGCGCCGCC AACCGCGATG CCTCAGGGAT CAATGGGATC AACCGTCGAA
ATCACATACT GGGGATCGTT CAGCGGCGTT CTGGGTGAAG CCGAACAGGC GACCGTCGAG
ACATTCAACA GCATGCAACA GGATGTCAGG GTCAACTACC AGTTCCAGGG CAATTACGAA
GAGACTGCGC AGAAACTGAC CGCTGCGGTT CAGGCGCGGC AGACCCCCGA CGTCAGCCTG
CTCTCGGATG TCTGGTGGTT CTCGTTCTAC ATCAACGGTC AGTTGCAGCC GCTCGACGAC
CTGATGGCAG CCGAAGGGGT CAAGCGCGAA GCGTATGTTG ATGTGCTGCT CAACGAAGGT
ATTCGCAAAA ATACGGTGTA CTGGATTCCG TTCGCGCGCT CGACGCCGCT GTTCTACTAC
AACAAGGACG CCTGGGCGGA AGCCGGTCTC GACGATCGCG CGCCGAAGAC GTGGGAGGAG
TTTATGGAGT GGGCGCCGAA ACTCAACAGA GAAGGGCGTT CGGCTTTCGC CCACCCTGGC
GCCGCCAGCT ACATCGCCTG GCTCTTCCAG GGGGTGATCT GGCAATACGG CGGTCGCTAC
AGCGACCCCG ACTTTACCAT CCGCATCCAC GAAGAAAATG GCATCAGAGC GGGCAACTTC
TACCGCGATA CGACCCAAAC CTACAAGTGG GCGACGACGC CGAAGGATGT AACCCAGGAC
TTTGTCACCG GGCTGTCAGC CAGCGCGATG CTGAGCACCG GCGCGCTGGC AGGCGTTGAG
AAGAACGCGC AGTTCCCGGT TGGCACCGGT TTTCTGCCGG AGGGTCCGTC TGGCTTCGGA
TGCTGCACCG GTGGCGCAGG CATGGCAGTT CTGTCCGGAT TGCCCGCCGA GAAGCAGCAG
GCTGCGATGA AATGGATCGC CTTTGCCACC GGTGATGAGT GGGCAGTAGA CTGGTCTCAG
CGCACAGGGT ACATGCCGGT GCGCAAGGCG GCAGTCGAGT CGGAGCGCAT GAAGCAATAC
CTCGCCGAAC GACCGAACTT CCGCACCGCC GTCGAGCAAC TGCCGAAGAC ACGTCCGCAA
GACTCGGCGC GCGTCTACGT TCGCGGCGGC GACCAGATCA TTGGCAAGGG GCTGGAGCGC
ATCACGATTG CCGGCGAAGA CCCGGCGAAG GTCTGGATGG ATGTCAAGGC GGAACTTGAA
GAGACCGCGA AGCCAACCGT CGAACTGCTG AAGACGGTTG AGGGTTAG
 
Protein sequence
MAKINRRQFL RGVAAGAGAL TLAACGGAAT TSPTGEPAAP PATAAPPTAM PQGSMGSTVE 
ITYWGSFSGV LGEAEQATVE TFNSMQQDVR VNYQFQGNYE ETAQKLTAAV QARQTPDVSL
LSDVWWFSFY INGQLQPLDD LMAAEGVKRE AYVDVLLNEG IRKNTVYWIP FARSTPLFYY
NKDAWAEAGL DDRAPKTWEE FMEWAPKLNR EGRSAFAHPG AASYIAWLFQ GVIWQYGGRY
SDPDFTIRIH EENGIRAGNF YRDTTQTYKW ATTPKDVTQD FVTGLSASAM LSTGALAGVE
KNAQFPVGTG FLPEGPSGFG CCTGGAGMAV LSGLPAEKQQ AAMKWIAFAT GDEWAVDWSQ
RTGYMPVRKA AVESERMKQY LAERPNFRTA VEQLPKTRPQ DSARVYVRGG DQIIGKGLER
ITIAGEDPAK VWMDVKAELE ETAKPTVELL KTVEG