Gene Rcas_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2800 
Symbol 
ID5540287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3620835 
End bp3622727 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content60% 
IMG OID640894927 
Productextracellular solute-binding protein 
Protein accessionYP_001432889 
Protein GI156742760 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACA ACAAACGACG ACTGACCCGG CGAACATTCC TGCGCGCTGC GGCTATCGGC 
ATTGGTTCAG CGACACTTGC CGCCTGTGGC GGCGGCGGCG GCGCCACTGC TCCTACAGCG
CCGCCCGCAA CGGTTCCTCC GCCGACGACC GCTCCAACCC TCGCGCCGGT GCAACCCACG
CCGCCTCCGA CGACAGCCCC CGCGCCCACG ACTGCACCAG CCAGCGCCGT GACCAACTCG
CTCGGCGTCA CCCTGCCGGC AAACGCAGCG CCGCTTGAGC ATCAGGCATT TGTCGTCTAC
TTCGACATCA CTGCCGACTT CACCAGCCCC AATCAGATGG AGACGATCTA CAAGTCGGGA
GGGTTTGGCA GCATCACCAA CCTGACCGGC GATACGCTCG TGCGCCTGAA TAAAGATTTT
CAGGTACAAC CGGCTGCTGC GCTTTCGTGG TCGTCGGATG AAACCGGCAA GGTCTGGACG
TTCAACCTGG ACCCCAACCT GGTCTGGAGC GATGGCACAC CGGTGACAGC CGAAGACTTC
GTGGCGACCT TCCGCTACGC TGCCGATCCG AAGCATGCCT GGGATTTCGC CTGGTACTAC
AGCGCACCCG GCGCAATCAA GAACTGGGAC AAGTGCGTTG CTGGCGAACT GCCGCTGGAA
GAGCTTGGCG TGACCGCCAA AGATGCCCAT ACGCTGGTTA TCGAAACCGA AACGCCAGCC
CCTTTCTTGC CCGCAAAACT GGTCTACAGC GAGGTGTTGA GCGCCGCAAA ACTGAAGGAA
TATGGTTCTG GTCTCTATAC CGCCGATCCG GCAAAGACGA TTTCGTGCGG ACCGTACCTG
CTGAAAGAGT TCAAGCCGGG CGAGCGGGTG GTCTTCGAGA TCAACCCGAC GTACAAAGGC
ACCAACCGCC CGCGCATTGA GCGTGTCATC CAGATCGCTG CGCGACCGGA AGCCATGTTC
GCCGGGTATC AGGCAGGCGA GGTGGACCGC GTGACCGGAG AGCAGTTGCA GACTGCCGAT
AACGAAATCA TTGCCCGCGA CCCCGAACTG TCGAAACAGG TGCGCCTCAC TGCTGCCGAT
TTCCGCACGG ACTACCTCTT CTTCGACTGC CAGAACCCGC CGTTCAACGA CGTGCGGGTG
CGCCAGGCGT TCAGCCACAT CATCGATCGT GACACGCTGA TCAAGACGAT CATCACGCCG
ACGCAAGGCA TCCCGGCATA CTCGTTCCTG ATGCCGGGAT TCCCGGCGTC GAACTCCGAA
GGGCTGAAGG ATATTCAGCG CTACGATCCC GAACGTGGCC GCGCGCTGCT GAAGGAAGCC
GGCTATGAGG GAGGCAAGGG CTTCCCCAAA CTGACCCTCT GGCTGCGCAA CGAACCGCAG
ATTCGCCAGG CGCTCGCAGC GGCGATTGCG GCGGCGATCA CGCAGGAGTA CGGCATCGAA
GTCGAGGTCT CGAACAAAGA GTTCAAGACC TTTATGGACG CGCTCAACGC CAAGCCGACC
CAGATTCAGT TCGGTATGGT GTCGTATGGC ATCGACTTCC TCGATCCGTC GAATATGCTC
GGCGTCTGGC TCAGCACGGG GCGCCACAAC TGGTTCAACA AGAAGTTCGA CGAGATGGTG
CTGAAAGCGG CAGAGATGAC CGATCAGGAA GCGCGCATCA AAATCTTCCA GGATGCCGAA
CGGTTGCTCT GCGAAGAAGC GCCGGCGGTC TTTATCTATC ACCGCACCGT TGCCGATATC
TACAAGCCGT ATGTCGTCGG CGAGTGCTTC GAGCCGAACA TCGCCGGATT CGCCGGATTG
CAGTGGCCCG GCTTTACATC GATGAGCGAC TCACTCCAGA CGCTGTACAT CAGCGATGAA
GTGACAAAAT ATCGCAAGGC GCCGCCGAAG TAG
 
Protein sequence
MTDNKRRLTR RTFLRAAAIG IGSATLAACG GGGGATAPTA PPATVPPPTT APTLAPVQPT 
PPPTTAPAPT TAPASAVTNS LGVTLPANAA PLEHQAFVVY FDITADFTSP NQMETIYKSG
GFGSITNLTG DTLVRLNKDF QVQPAAALSW SSDETGKVWT FNLDPNLVWS DGTPVTAEDF
VATFRYAADP KHAWDFAWYY SAPGAIKNWD KCVAGELPLE ELGVTAKDAH TLVIETETPA
PFLPAKLVYS EVLSAAKLKE YGSGLYTADP AKTISCGPYL LKEFKPGERV VFEINPTYKG
TNRPRIERVI QIAARPEAMF AGYQAGEVDR VTGEQLQTAD NEIIARDPEL SKQVRLTAAD
FRTDYLFFDC QNPPFNDVRV RQAFSHIIDR DTLIKTIITP TQGIPAYSFL MPGFPASNSE
GLKDIQRYDP ERGRALLKEA GYEGGKGFPK LTLWLRNEPQ IRQALAAAIA AAITQEYGIE
VEVSNKEFKT FMDALNAKPT QIQFGMVSYG IDFLDPSNML GVWLSTGRHN WFNKKFDEMV
LKAAEMTDQE ARIKIFQDAE RLLCEEAPAV FIYHRTVADI YKPYVVGECF EPNIAGFAGL
QWPGFTSMSD SLQTLYISDE VTKYRKAPPK