Gene Rcas_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0906 
Symbol 
ID5538372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1185887 
End bp1187866 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content60% 
IMG OID640893056 
Productextracellular solute-binding protein 
Protein accessionYP_001431039 
Protein GI156740910 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACGGA CACGACTGCT TCTGACGGGG CTGCTGCTCG TGGTCAGCCT GATCATCGCC 
GCCTGTGGGG GAGGACCGGC GGCGCCAGCA GCAACACCAG CCGGTTCAGC GACGACGCCA
GCCGCGCCGG CGCCGGCGGG CGGAACAGAC GCGACACTGA CGGAGTTTGG CGATCTGCCG
CGCCACGAGA CGTTGATCGT CGATATTCTC ACCGGGCGGG TCGGCTCGCC GGACGACTTC
AACAACTGGG TGGGCTGGAA GTGGCGTGAT CGCGGGATGC AGAACCTGGC GAACGAGCCG
CTCTGGTCGG TTGATTTCGC CACCGGCAAG ATTATTCCGG GTCTGGCAGA GGGCGATCCG
GTTTACAACG CAGATTTTAC CGCCCTCACG ATTCCGCTGC GCAAGGGGGT GACGTGGCAC
GACGGACAGC CGTTCACAGC GGCAGACGTG GTCTTCACCG TCGAAACGCT GATGAAACAC
GAGGGGTTCG GCGACAACAG TTTTTTCGTG GAGAATGTGA AATCGGTCTC CGCCGTTGAT
GATCATACCG TCGCCTTCGA GTTGAACGCG CCGAACTCGC GCTTCCACAC CCGCTTCCTG
GATCGCTGGG GCTGCACCTG GATTATGCCC AAGCATATCT GGGAGTCGGT GGAAGACCCG
GTCACGTTTA AGTTTAACCC CTTCATCGGC ACCGGTCCGT ACAAACTGCA CAGTTTCGAC
CCATCCGGGT TCTGGACGAT TTGGGAGAAA CGGGCTGACT GGGACAAGAG TCCGACCGGC
ATGATGTACG GTGAACCAAA GCCCAAATAT GTCATCTTCC GGTACTTCGC CAACGAAGGC
GCTAAGATTC TGGCGCAGTT AACCCATCAG GTCGATGTTG TCAACGTGTC GTCCGACGGG
CTTAAGGCCG TGCTGACCCA GTGCGACTCC TGCCGCGCCT ATCAGTTGAA CTGGCCCTAC
GTCGTCAACA ATGATCCGGC GCAGACCGGC ATCACGTTCA ACACCGCCAG GGCGCCGTAT
GACAACCGCG ATGTGCGATG GGCGCTGCTG CTGGCAATTG ATATTGCCGA ATACATGGGC
ATTGCCGTCG ATGGCACCGG CGCGCTCAGC CCGGTTCACA TTCCGTCGCT GTCGAACTAT
CCCAAAGACT TCATCCAACC GATGCTGCCC TGGCTGGAAG AGTTCACCCT CGATCTCGGC
AATGGCGAAA CCTTCAAGCC CTTCGACCGA AACGCCTCGC AGCGCATCGC CGAGTATGCT
CGCTCGCGCG GCTACGCTGT GCCCGACGAT CCCGCCGAGC AGGCAAAACT GTTTGGCTAT
GGCTGGTACA AGTATGCGCC CGATGTCGCC GAAAAGCTGC TGGTCAAGAA CGGCTTCACC
AAGACGTCCG ACGGCAAATG GCTCTTGCCG GACGGCACGC CCTGGAAGAT TCGCTGCCTG
ACCGGCACGC AACTGGCGAC CGGCATGGGT GAACGCAACT GCGTCGCCGC TGTGCAGCAG
TGGAAGAGAT TCGGCATCGA CGCCGAGGTG TATTCCTCGG AAGCGGCAGC AAGCCTGAAT
GCAACCGGCG ATTTCGACGT TTCCAGCAAC TGGCCCGCGC AGGAACCCTG GGGCGCCGGA
CCAGACCTCT ACCGTGTGCT CGACTACTAC AACTCGGCGT ATGTGAAACC GGTCGGCGAG
AATACCAGCG GTCACCCGTC GCGCTGGTCG AGTCCGGAGA TGGATGCGAC GATCGAGAAA
TTGCGCCAGA CCGATCCCAC CAATTATCAG GCGGTCGTTG ATGTCGGCAT CGAAGGCTTG
AAGATTGCCG TGCGTGAAAT GCCCGGCATT CCGACCTATG GCTATGTCGG GTTCATTGCA
TGGGATCAGA CCTACTGGAC CAACTGGCCC GGCGCTGAGA ATCCCTACAC GCAACCGTAT
ACGCACTGGG GTCCGTTCAA ATATATGACG CCGTTCCTTC AGCCAACCGG GACGCGGTAA
 
Protein sequence
MRRTRLLLTG LLLVVSLIIA ACGGGPAAPA ATPAGSATTP AAPAPAGGTD ATLTEFGDLP 
RHETLIVDIL TGRVGSPDDF NNWVGWKWRD RGMQNLANEP LWSVDFATGK IIPGLAEGDP
VYNADFTALT IPLRKGVTWH DGQPFTAADV VFTVETLMKH EGFGDNSFFV ENVKSVSAVD
DHTVAFELNA PNSRFHTRFL DRWGCTWIMP KHIWESVEDP VTFKFNPFIG TGPYKLHSFD
PSGFWTIWEK RADWDKSPTG MMYGEPKPKY VIFRYFANEG AKILAQLTHQ VDVVNVSSDG
LKAVLTQCDS CRAYQLNWPY VVNNDPAQTG ITFNTARAPY DNRDVRWALL LAIDIAEYMG
IAVDGTGALS PVHIPSLSNY PKDFIQPMLP WLEEFTLDLG NGETFKPFDR NASQRIAEYA
RSRGYAVPDD PAEQAKLFGY GWYKYAPDVA EKLLVKNGFT KTSDGKWLLP DGTPWKIRCL
TGTQLATGMG ERNCVAAVQQ WKRFGIDAEV YSSEAAASLN ATGDFDVSSN WPAQEPWGAG
PDLYRVLDYY NSAYVKPVGE NTSGHPSRWS SPEMDATIEK LRQTDPTNYQ AVVDVGIEGL
KIAVREMPGI PTYGYVGFIA WDQTYWTNWP GAENPYTQPY THWGPFKYMT PFLQPTGTR