Gene Rcas_0516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0516 
Symbol 
ID5537979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp671225 
End bp672691 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content60% 
IMG OID640892678 
Productextracellular solute-binding protein 
Protein accessionYP_001430664 
Protein GI156740535 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAA GACAGTACCC CTGGATTCTC CTGCTCGGTG CGTTGATGCT GGTGCTGGCA 
GCATGTGGCG GGCAACAAGG CGCCTCCCCC ACGGCAGCGC CGGGCGAAGC GCCAACCACT
GCGCCGGCCA GTGACCTGCC GACTCCTACG CCGGTTATTG AGTTTGCGCA GCAACCACAG
TCCGGTCAGA AGGTGCTGGT CTGGATGGTG CGCATCAACA CGGTCGAAAA CCGCTGGGAG
CGTGATGTTG TTCTGCCGGC ATACCAGCAG GTCGCCCCCG ATGTATTCGT GAAGGTGCTC
AATATCAACC AGGACGATAT CGCCGTCAAG CGTGAGGCGA TGATCGCCGC CAAGGAACCG
CTGCACGTCT GGTCGTCGAA CTGGGGCGGC GATGGCTTCG CCAGCGACCG CTTCCGCGGC
TTGCTGGCAG ACCTGACGCC GCTCATCGAA CGCGACAAGT GGGATACGAG CGACTTTATC
CCCGAAGTGT TCGGCATCTA CAACGTCGAG GGCAAGCAGT ACGGCATTCC GTTCCTCACG
ACCGGCAGTT ATGTGTACTA CAACATGAAA CTGTTCGACG AGGCTGGCGT GCCCTATCCG
CCGAGCGACT GGAACGATAA GTCGTGGACG TGGGATGCAT TCCTTGAGAC TGCCAAGAAA
TTGACGAAGA ACCCGGATGA TCCGAGCACG GCGGTGTACG GCGGTGTCAA CGGTTTGTGG
CCCCCCTTCG ACAGTATTCC GATGATCTGG GGAAAGGACC CGTTCACAGC ACAAGCGCTC
GAGACCGGCT TCTCCGATCC GATCAAACTG GATGAACAGA CGGCTGCGGC GTTCCAGGCG
GTTCACGATC TGGTCTACGT CCATAAGGTC GCGCCCGACC AGGCGGCGTC TCAGGCGCTC
GATCAACTGG GTGGCGCCTT CCTCTCCGGT CGAGTCGCCA TGTTTATGAC CGGCGGCTGG
GGTCACTGGA ACTATAAGGA TATCATCGAC GATCCCAACG GTTTCTGCTG GGGTGCAGCG
CCACTCCCCT GGGGGACGCC CGATGCCACT ATCCGCGCGA CGATCTTTAC CGATCCCTGG
GTCGTTACTG CTGGCATGGA TGCAGAGAAC ACCGATATGG CATGGAACTT CGTGAAGTTC
CTGGCATCGG CAGAGCAGCA GCGCGCTTAT ACCCTGGCTA CCGGAACCCC GCCGGTGCGC
CAGAGCCTGC TCAATGATTA CTACAAGCAG TATGAGAAGT GCGTCCCGGC AGAAAAGACG
AAAGAGTCGT TCCAGGGCGC CTTTAGCCAC GGACGCGAGT CATCGAACCA CCTGCTGGTC
AAGTTCGATG AACTCAGCCA GACCTGGGAT AACCTCCTGA GTCCGTTCTG GAACGATCCG
AACGCGAAAG CCTCGGATCT GATGCCGCTC CTCGAGTCGG ACGTCAATGC CGCTCTGGAG
CGCATCCGCA AAGAGGCGGG CAGGTAA
 
Protein sequence
MKIRQYPWIL LLGALMLVLA ACGGQQGASP TAAPGEAPTT APASDLPTPT PVIEFAQQPQ 
SGQKVLVWMV RINTVENRWE RDVVLPAYQQ VAPDVFVKVL NINQDDIAVK REAMIAAKEP
LHVWSSNWGG DGFASDRFRG LLADLTPLIE RDKWDTSDFI PEVFGIYNVE GKQYGIPFLT
TGSYVYYNMK LFDEAGVPYP PSDWNDKSWT WDAFLETAKK LTKNPDDPST AVYGGVNGLW
PPFDSIPMIW GKDPFTAQAL ETGFSDPIKL DEQTAAAFQA VHDLVYVHKV APDQAASQAL
DQLGGAFLSG RVAMFMTGGW GHWNYKDIID DPNGFCWGAA PLPWGTPDAT IRATIFTDPW
VVTAGMDAEN TDMAWNFVKF LASAEQQRAY TLATGTPPVR QSLLNDYYKQ YEKCVPAEKT
KESFQGAFSH GRESSNHLLV KFDELSQTWD NLLSPFWNDP NAKASDLMPL LESDVNAALE
RIRKEAGR