Gene Rcas_1578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1578 
Symbol 
ID5539054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2027368 
End bp2029155 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content63% 
IMG OID640893716 
Productextracellular solute-binding protein 
Protein accessionYP_001431689 
Protein GI156741560 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.259952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGCA GAATCTTCCC CGTGAGTAAT TCGCATCTAT CGCCAGTCCA TTCCACTCCT 
GTGCTCCGGT TGCCGTTGCG CCGATTGTTT GGCGTCAGCA CATTTGTTGC AATTGCGTTG
CTGGGGGTGC TTGTCGCCTG TGGTGCGCCG GAACAACCTG CCACACCCGC GGGCGTCTCG
CCGATGCCTG CGCCAACCAC AGCGCCGCCG ACTGCGACAG CGCTGCCGCG CGGCGGCAAT
CTTACAATCC GGTTGGCTGC CGATGTTGCT GAGTTGCGCC CCTGGCATCC ACGTACACGT
GGCGAGGAAC AGCTCATCGC GCTGCTCTAC AGCGGTCTGA CCCGCCTGGA CGGCACGCTG
GCGCCGCAGC CGGATCTGGC GGCAGGCTGG ACGGCTTCTG CCGATGGGCG GACAATTACA
TTGACGTTGC GCCACGATGC GGTCTGGCAC GATGGACAGC CGGTGACGGC GGATGATGTG
GTCTTTACTC TCAACGAATT GCGCGCACTC GAACCAACCA CGGCGTTGCT CGCCGGGTTA
CGGCGTATGA CGGAGGTTAC AGCGCCGGCA ACCGATACCG TCGTGGTGCG TCTCGATGAA
CGCTACGCGC CGATCTTTAG CCTGTTGACG GCGCCGGTGT TGCCGCGCCA TGCCCTGAGC
GGCAGAAGTT TGGCGGATCT CAACGCCTGG GAAGCGCCGG TCGGCAGCGG TCCATTTCGC
CTGGAGCGGC GTGAGCCGGG CACGGCGATC ACGCTGGCGG CCAATCAATC GTTTTACCGG
GGCGCGCCGT TGCTCGACCG GGTGGTGTTC GTGGTGGCGC CCGATGCGCA GGTGGCGGCG
TCCGCGCTCC AGAATGGGCA GTTGCTCTTG GCGGAACTGC CCTGGAGCGA AGGGCGCGCG
CTCACCGAAA CAATGCCAAT GCTCCAAACC GGCGCATATG CCGAGAATGG CTACTACTTC
CTGGCATTCA ACCTGCGCCC CAACCGTATC TTCAGTGATC TGCGGCTGCG TGAGGCGCTG
GCGCTCACTA TCGATCTGCC GCGCATGATC CGGGAAGCGA CTAACGGGCA GGGCATGATC
ATCGGGAACA GCGCCGCTCC TGGCTCCTGG GCAGACCTGA CGCCGCCGTC AACGACGACC
GTGGACCTGG ATCGCGCGCG CGCGCTGCTC GATGAGGCAG GGTGGCGGCT CCCGTCGGAT
GGCGTGGTGC GGCAAAAAGA TGGTGTGACG CTCACCGCGC AACTCTTTGT GCGCGCTGAC
GATCCGCGTC GGGTGCGCGC TGCCGAACTG ATTGCTGGCG CCGCCGAGCA GATCGGGATG
GACATTGTGG TGCAACCCGC TGACTTCGCA ACGGTGATTC GCTCGAAGTA TGCGCCACCC
TATGATTTCG ACATGCTCCT CGGCAGCTGG ATCAATGGCG TCGCCGACCC GACTTTCGGT
GACTACGCCT ACTACGATCC AGACGATTTT GCGCTGTTTC ATTCGAGTCA GATCAACCAG
GGGGTGGCGG ATACGCGCCC TGTGTTGAAC TTTGTCGGGT TTAGCGACCC GGTGTACGAT
GATCAGGCAG GCGCAGCGCG GCAATTGTAC GATCTGACAG AACGGGCGCA GGCAATCCGG
CGGGCGCAGG AACGAGTGGC GCTGCTGCGT CCCTACCTGT TCCTGTGGAC GGATCGGTTG
CCGGTGGCAT GCAGCGCGCG CCTGACAACG CTGGACGGAC CGATTAACCT GGCGACGCCG
AACTATCTGT GGAATATCGA ACGGTGGTAT GTCACAGGTG AGGGTTGA
 
Protein sequence
MSSRIFPVSN SHLSPVHSTP VLRLPLRRLF GVSTFVAIAL LGVLVACGAP EQPATPAGVS 
PMPAPTTAPP TATALPRGGN LTIRLAADVA ELRPWHPRTR GEEQLIALLY SGLTRLDGTL
APQPDLAAGW TASADGRTIT LTLRHDAVWH DGQPVTADDV VFTLNELRAL EPTTALLAGL
RRMTEVTAPA TDTVVVRLDE RYAPIFSLLT APVLPRHALS GRSLADLNAW EAPVGSGPFR
LERREPGTAI TLAANQSFYR GAPLLDRVVF VVAPDAQVAA SALQNGQLLL AELPWSEGRA
LTETMPMLQT GAYAENGYYF LAFNLRPNRI FSDLRLREAL ALTIDLPRMI REATNGQGMI
IGNSAAPGSW ADLTPPSTTT VDLDRARALL DEAGWRLPSD GVVRQKDGVT LTAQLFVRAD
DPRRVRAAEL IAGAAEQIGM DIVVQPADFA TVIRSKYAPP YDFDMLLGSW INGVADPTFG
DYAYYDPDDF ALFHSSQINQ GVADTRPVLN FVGFSDPVYD DQAGAARQLY DLTERAQAIR
RAQERVALLR PYLFLWTDRL PVACSARLTT LDGPINLATP NYLWNIERWY VTGEG