Gene Rcas_0383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0383 
Symbol 
ID5537845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp480347 
End bp482203 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content61% 
IMG OID640892546 
Productextracellular solute-binding protein 
Protein accessionYP_001430533 
Protein GI156740404 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.531894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.778896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCGA TGCGTGACGT ATGGCGGCGA AGCGGCGCGC TGGCGCTGCT TCTTATCCTG 
ATCATCCCGG TCCTGGCGGC GTGTGGCGGT CAGCAACCGG CAGCGCAACC GACGGCAGCG
CCAGCGCAAC CGACGGCAGC GCCAGCGCAA CCGACGGCAG CGCCAGCGCA ACCGACGGCA
GCGCCAGCGC AACCGACGGC AGCGCCGACG GCGCCACCTG CCGCACAGCG GGGCGGGCGC
CTGAAAATCC TCTACTGGCA GGCGGTGACG ACCCTCAACC CGCACCTGGC GACCGGTACG
AAGGACTTTG ATGGCGCGAC CGTTATCCTC GAACCGCTGG CGCGTTACAA CGAAAAAGAT
GAACTGGTGC CCTTCCTGGC GGCTGAGATT CCCACCATCG AGAATGGCGG CGTCGCTCCG
GATGGAACCA GCGTGACCTG GAAACTCAAA CCAGGGCTCA AGTGGTCGGA TGGGAGCGAT
TTCACAGTCG ACGATATCAT CTTTACCTGG CAGTACTGTG CTGATCCGGC GACGGCCTGC
ACGACGAAGG CGGTCTTCGA TCCGATCGCC AATGTCGAGA AGGTCGATGA TACGACGGTC
AAGATCACCT GGAAAGAGCC TACTGCCTAC CCCTACATCG CCTTCGTCGG TCCGAATGGC
ATGATCCTCC AGAAGAAGCA GTTCGAGAAG TGCATCGGCG CGGCAGCCAG CACCGATGCG
GCATGCCAGG CGGCGAATCT GGCGCCGATC GGTACGAATG CATGGAAGCT GAAGGAGTTC
AAGCCGGGCG ATGTGGTGAT CTATGAGCGT AACCCCTTCT TCCGCGATGC CGACAACGTC
TTCTTCGACG AGGTCGAGAT CAAGGGCGGC GGTGATGCTG CCTCAGCCGC GCGCGCCGTC
TGTGAGACGG AAGAGGTGGA TTTTGCGTGG AACTTGCAGA TTCCGAAGGC GGTGCTCGAG
CCGATCCTTG CGAGCGGTAA GTGCGACCCG CTGGCCGGCG GTTCGTTCGG TGTCGAGCGT
ATTGTCGTCA ACTTCGCCAA CCCCGATCCG GCTCTCGGCG ATAAGCGCAG TGAACCGGAT
CAACCGCATC CGTTCCTGAC CGATCCTGCC GTGCGCAAGG CGATCTCGCT GGCGATTGAC
CGCAAGGCGA TTGCTGAGCA GTTGTACGGA CCGACCGGCA AGCCGACCTG CAATGTGCTG
GTCGTGCCTG CATCGGTTAA CTCACCGAAC CTGACGTGTG AGCGCGATGT CGAAGCGGCG
AAGAAGTTAC TCGAGGATGC GGGCTGGAAG TTGAACGGCT CGGTGCGCGA GAAGGAAATC
GGCGGGAAAC CGGTCCGGCT TGTCGTCAGT TTCCAGACCT CAATCAACCC GCTGCGCCAG
AGCACGCAGG CGATCATCAA GTCGAACCTG GCGGAGATCG GCATTCAGGT GAACGTCAAA
GCCATCGATG CCAGTGTCTT TTTCGGCGGT GATGAGGGCA ACCCGGATAC GCTGAACAAG
TTCTACGCCG ACCTCCAGAT GTATACGAAC GGTCCGAGCA GCGCCGATCC GCAGCAATAC
CTCCAGGGGT GGCTCTGCTC CGAGCGCGCG TCGGCGGCGA ACCGGTGGAA TGGCAACAAC
GACGGACGTT ATTGCAACCC GGAGTATGAC GCCCTCTTCG AGCAGTTGAA GAAGGAACTT
GATCCGAAGC AGCGCGCCGA ACTGGCGATC AAGATGAACG ATCTGCTGGT GACCGATGGC
GCCATCATTC CGCTTATCAA CCGCCAGACG CCGAATGCGA AGGTGAAGGC GCTCAAAGGT
CCGACCTTCA ATACGTTCGA CTCGAGCATC TGGAATATCG CCTCCTGGAG CAAGTAA
 
Protein sequence
MRSMRDVWRR SGALALLLIL IIPVLAACGG QQPAAQPTAA PAQPTAAPAQ PTAAPAQPTA 
APAQPTAAPT APPAAQRGGR LKILYWQAVT TLNPHLATGT KDFDGATVIL EPLARYNEKD
ELVPFLAAEI PTIENGGVAP DGTSVTWKLK PGLKWSDGSD FTVDDIIFTW QYCADPATAC
TTKAVFDPIA NVEKVDDTTV KITWKEPTAY PYIAFVGPNG MILQKKQFEK CIGAAASTDA
ACQAANLAPI GTNAWKLKEF KPGDVVIYER NPFFRDADNV FFDEVEIKGG GDAASAARAV
CETEEVDFAW NLQIPKAVLE PILASGKCDP LAGGSFGVER IVVNFANPDP ALGDKRSEPD
QPHPFLTDPA VRKAISLAID RKAIAEQLYG PTGKPTCNVL VVPASVNSPN LTCERDVEAA
KKLLEDAGWK LNGSVREKEI GGKPVRLVVS FQTSINPLRQ STQAIIKSNL AEIGIQVNVK
AIDASVFFGG DEGNPDTLNK FYADLQMYTN GPSSADPQQY LQGWLCSERA SAANRWNGNN
DGRYCNPEYD ALFEQLKKEL DPKQRAELAI KMNDLLVTDG AIIPLINRQT PNAKVKALKG
PTFNTFDSSI WNIASWSK