Gene Rcas_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0197 
Symbol 
ID5537658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp236232 
End bp238049 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content63% 
IMG OID640892360 
Productextracellular solute-binding protein 
Protein accessionYP_001430348 
Protein GI156740219 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.386701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGC AACGACTCAG AAGTTTCGTG CAGCGTGTTC TGATCCTGAC GGTCGCCTGC 
ATCGCACTGG CGGGATGCAC GATTGAGGGA GGCGTCCCGA CGCCGACGCC TGAACCAACC
CTCCCGGCGA GCGCTCCCAT TCCCACCGGT GCAGTGGTTG CCGAACGCAT CGCCGAGCGC
AACGACACAT GGATGATCGG GGCGCTTGAT CTTCCAGCGG ATCTGTACCC ATACCCGCAG
TCCGCCGCCA CCCGCCGCGC ATCGGCGACG ATCACCGAGT TGCTCTTCCC GTCGCCAATC
CTACCCTATA ACTACGGTTA TACCGCAACA GGAGTGCTCG AACGCATCCC CACACTCGAA
AATGGCGATG CGGAAATGCG CAAGGTCGAT GTCTATCTCG ATGCCACCGG CGCGATAACC
ACTACGGTCA CCGATGTCGT CACCCAGGTC GATCAACTGG TGATCACCTT CCGCTGGAAC
CCGCGGCTGC GCTGGTCCGA CGGCACGCCG GTTACAGCGG ACGACTCGGT GTTCGCCTAT
GAATTGGCGA AAGCCGCGCC GCCAGGCGAC GCGGCAGCCG AATTGCTGGC AAAAACCGCT
ACATACGAGA AGATCGATGA TCATACCACG CGCGCGGTGC TCCGCCCCGA TTATGTGGGG
GCGGCATATT TTGTGAGTTA CTGGACGCCG CTGCCACGCC ATCTGCTCCA GGGCGTCGAT
CCGGCGCGGG TGCGCGAGAG CGCATTTGCT CGTCAACCGG TCGGGTATGG TCCCTATATG
CTGGTCGAAC GCACTGCCAC TGAACTGCGC TTCGAGCGCA ACCCGCATTA TTTCGGTCCG
ACGCCAGCGG CGTCGCGGCT GGTCGTGCGC GTGTTTCCCG ATCTCGACCT GCTGCGCGCC
AATCTGCTCA ACGGCAATCT CGATCTGGGC ATTGCTGATC GGATCTCGAC CGCCCCGCTG
ATCCGCTTCG ACACCGACGC CGCCGAAGGC GCCGTACAGG TCTTCACCGT CTCCAGTCCA
GTTTGGGAAC ATATTCTGTT CAATCTGGAT GTTCCGGCAC TCCAGGATAT TCGGGTGCGG
CGCGCGCTGG CGTATGGCAC AAACCGGCAG GCGATGGTCG ATGCGCTCTT TGGTGGACGA
ACGCCGGTTC TGGATGGTTG GGTGGTGCCG GAGCACCCCC TTGCCGCGCC GCCCGATCAG
GTGACCCGCT ACCCGTATAA TCCCGATCAG GCACGGCAGT TGCTCGATGA AGCCGGATAT
ACCGACCCCG ATGGCGATGG CATCCGTGCG TCGCCCGATG GCGCCACGCT AACGCTGCAA
CTGCTGACGA CGCAAGGGAG CGACGTGCGG CGCGCAATTG CCCGCCGTTT TCAAGCAGAT
ATGCGCGCAA TCGGCGTCGC AATCGATATT AACGAAGCGT CGCCCGACGA GGTGTTCGAC
TCAGATGGAC CCCTCTACCT CCGACAGTTC GATCTTGCGC TCTTCGGGTG GATCGCTGGA
CCAGAGCCGG GCGGGTTGCA ACTCTGGAGT TGCGCCGCCG TTCCCGCCGA GAGCAACAAC
TATCGTGGCG AGAACTTTGC CGGCTGGTGT TTCCGCGACG CGGATCGCGC GGTACGCACC
GCCGACACGA CCCTCGACCC CGCCGAACGG GCTGAGGCGT ACCTGCGTCA GCAGCAACTG
TGGACGCAGG AACTCCCGGC GATTCCTCTG TTTCAACGCC TGAGCATCGT GGTGGCAGCA
CCCGATGTGC GTGGGCTTGC CCCCGATCCC CTCGCGCCGG TGACATGGAA TGTGGCGGCG
TGGAAAAGGG AAAAGTAA
 
Protein sequence
MTQQRLRSFV QRVLILTVAC IALAGCTIEG GVPTPTPEPT LPASAPIPTG AVVAERIAER 
NDTWMIGALD LPADLYPYPQ SAATRRASAT ITELLFPSPI LPYNYGYTAT GVLERIPTLE
NGDAEMRKVD VYLDATGAIT TTVTDVVTQV DQLVITFRWN PRLRWSDGTP VTADDSVFAY
ELAKAAPPGD AAAELLAKTA TYEKIDDHTT RAVLRPDYVG AAYFVSYWTP LPRHLLQGVD
PARVRESAFA RQPVGYGPYM LVERTATELR FERNPHYFGP TPAASRLVVR VFPDLDLLRA
NLLNGNLDLG IADRISTAPL IRFDTDAAEG AVQVFTVSSP VWEHILFNLD VPALQDIRVR
RALAYGTNRQ AMVDALFGGR TPVLDGWVVP EHPLAAPPDQ VTRYPYNPDQ ARQLLDEAGY
TDPDGDGIRA SPDGATLTLQ LLTTQGSDVR RAIARRFQAD MRAIGVAIDI NEASPDEVFD
SDGPLYLRQF DLALFGWIAG PEPGGLQLWS CAAVPAESNN YRGENFAGWC FRDADRAVRT
ADTTLDPAER AEAYLRQQQL WTQELPAIPL FQRLSIVVAA PDVRGLAPDP LAPVTWNVAA
WKREK