Gene Rcas_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2119 
Symbol 
ID5539599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2722187 
End bp2723335 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content59% 
IMG OID640894253 
Productextracellular solute-binding protein 
Protein accessionYP_001432222 
Protein GI156742093 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTC GCGTTGCGCT CATTGGCGGA CCAATGTACA ACGCACTCTA TACTCGCCTG 
GATCAGTTCA GTCAGCAGAG CGGCGTTCAG GTGGAAGTCG CTTTTGTCGG CGATCATCCC
GCCCTTAACA CGTTCCTGGC GACAGATGCT GCGGCAGACT GTCATGTGGT ATCAACTCAC
ACCAAGTACG CCCCATCGCA GCAACGTCTC CTGGCGCCCC TCGACGAACT TTTGACGCCT
GCTGAGTGGA GCGACTTTAT GCCTTCACTC CTCGAATTGG CGCGCATTGA TGGTCGGCTC
TACGGCATTC CTCGCAACAT CGATGTGCGC CTGCTGCACT ATCGCACCGA TCTGATTGAC
CAACCGCCCA CCACGTGGGA CGAGCTGCTT GACCTGGCGC GCAGGGTCAA CCATCCGCCT
GAATGGTATG GCTTTCTCTT TCCCGGCACA GAGTCAGGAC TCTTTGGCAC GTTCTACGAA
CTGGTCGAGA GCGCGAATGC CAGGCTGTTT TCCCCTGATC TGACACCGAA TATCGAGAAT
GACGGCGGAC GCTGGGCGCT AGGGTTTTTG CGCACCTGTT ATGCGGAAGG ACTGGTTCCG
CCGGAGATTG TCACCTGGCA CTATGATGAG GTGCATCTCT GGTTCCGCGC TGGACGTGCC
GCGATGGTAG GAGATTGGCC CGGCTATTAT GCCGATTATT GCGCCACCGA CTCGCAAGTG
CGTGAACGCT TTGCGCTTGC ACTCTATCCT GCCGGACCGT CCGGGGGGGT GCGTGTGTAT
GGCGGCAGCC ATACTTTTGC TCTGACCCAC CGCGGGGTGG AGCAGACCGA TGCTGTCGCG
CTACTGCGCT TCCTCACCGC GCCCGAGCAG CAATTGCTGG AGGCGAAACA GGGTTCGACG
CCGGTGCGCC ATTCCGTTAT GCAGCGGATC GAGCAGCATG CCACACCACA CGAGCGCCAG
CGTTGGGCGA CCCTCGCCGC CGCTATTGAA CGGGTGGTCA TTCCCCCCAA ATTTGAGCGG
TATCCGCTGG TTGAGCAGGC GCTCTGGACA ACTGTCCAGC AGGCGATGGT CGGCGCCATA
GCAATTGACG AGGCCTTGCA TCGGTTGACA GACCGGATTA CCAGAATTGT GGCAGGCAAT
GATGGGTGA
 
Protein sequence
MTVRVALIGG PMYNALYTRL DQFSQQSGVQ VEVAFVGDHP ALNTFLATDA AADCHVVSTH 
TKYAPSQQRL LAPLDELLTP AEWSDFMPSL LELARIDGRL YGIPRNIDVR LLHYRTDLID
QPPTTWDELL DLARRVNHPP EWYGFLFPGT ESGLFGTFYE LVESANARLF SPDLTPNIEN
DGGRWALGFL RTCYAEGLVP PEIVTWHYDE VHLWFRAGRA AMVGDWPGYY ADYCATDSQV
RERFALALYP AGPSGGVRVY GGSHTFALTH RGVEQTDAVA LLRFLTAPEQ QLLEAKQGST
PVRHSVMQRI EQHATPHERQ RWATLAAAIE RVVIPPKFER YPLVEQALWT TVQQAMVGAI
AIDEALHRLT DRITRIVAGN DG