Gene Rcas_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2110 
Symbol 
ID5539590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2710683 
End bp2712890 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content59% 
IMG OID640894244 
Productextracellular solute-binding protein 
Protein accessionYP_001432213 
Protein GI156742084 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.233193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGTGT CCAAGTCACG CATCCTGAGT CGTCGCAGAT TTCTGACACT TTCAGCCATG 
ACCGCTGCCA GCGCCGCGAT TGCTGCTTGC GGCGGCGGGC AACCCGCGCA AGCGCCAACT
ACGGCGCCGG CGCCAACGGC AGCGCCAACC ACGGCGCAAG CGCCAACCGT CGCTATTCCG
CCAACTACGC CGCCGCAGGC GACCGCTGTT CCCCAGACGG TCAGCAAGTA TAAGGAGGCG
CCAGAACTGG CAAAGCTGGT TGCCGAAGGC AAATTGCCGC CCGTTGACGA GCGACTGCCG
AAAAACCCGT ATGTCGTACC GCACAAGTGG CTCACCGTCG GCAAGTATGG CGGAACGATC
AACTTTACCA ACTCTTGGGG ACCCGACGGT ATGGCGACGA TTGTGCAGGA GAGTCAGTAT
GGTCACTCAA TCCTGCGCTG GCTCGATGAT GGCCTGAAGA TCGGTCCGGG TCTTGCCGAG
AGTTGGGAAG CGAACGCCGA TGCCAGCGAG TGGACGTTCA AGTTCCGCGA AGGGCTGAAA
TGGTCGGATG GGCATCCCTG GACAGTTGAT GACATTCTCT ACTGGTGGGA GTACATGGTC
GGCGGCAACG GCAAGGAGAA GGAGTTCCCC GAAGGGCTGA AACCGATCGA GCCGCCGCCG
GACGAAGGGC GCTCCGGTAC GGGAACGCTG GCGACGCTGA TCAAAGTCGA TGACTATACG
CTGACGATGA AGTTCGACGC CCCCGCACCG CTGACGGCGG ATCGTCTGGC GATGTGGGTG
AATGCCGGCA TTGGGCCGCG CTGGATGGCG CCGCGTCACC ATATGGAGCA GTTTAACCCC
GTCCTCAATC CCGATAAGTA CAAAGACTGG GATGAGCACA CCAAGCGCAT TCGCTTCGTC
ACCAACCCCG ACTGCCCGAC GATGACCGGT TGGAAGTGTG AGAGTTTCGA GGAGGGTGTG
CGCGGTGTCT ATACGCGCAA TCCGTACTAC TGGTGCGTGG ACGCTGAGGG GAATCAGTTG
CCCTATATCG ACCGCATTAT CTCGACCACC TTCCAGAACA GCGAGGTCGA AAAACTGAAC
GTGATCCAGG GCAAGAGCGA CTTTACCCAT CACTGGGTGC TCAGCCTCGA TGATGTGCCC
AACCTGCGCC AGAATGCCGA TGCTGGCGGG TATGAGGTGC GCTTCTGGGA GAGCGGGTCG
GGTTCGGGCA CGTCGTTCTT CTTCAGTTAC GACTACAACG ATCCAAAGTG GCGCGAACTG
ATCCGCAACC CGAAGTTCCG TCAGGCGTTG TCGCTGGCGT TCAACCGCGC CGAATTGCAG
AAGACGCAAT ACTTCGGCAC CGGTGAGTTG ACGACCGGCA CCTTCAGCCC GAAAGCGATC
GAGTACAATA TCGACGAGAA AGGCAATCCG GATCGGGCCA ATGCACGCTA CCGCGAATGG
CGCGATAGTT ATGTGGCGTT CGATCAGGAG CGGGCGAAGG CGTTGCTCGA CGAAATTGGC
GTCAAAGTTG GCGCTGACGG GTTCCGCACC TTCCCGGACG GCTCGCCGCT CGATATTCAG
GTGATCGTCG CATCGAACAC GAGTAAGAAC ACTATCGCTC AAAACGAGCA GATGATCCGC
GACTGGGGGC AGATCGGCAT CAAGGCGACG CTTACTCCGG TGCCGCCGCA GGGTCGCCGC
GATGACTGGT TCGCGGGTAA GTTGATGTCG AATGCCGATT GGGGTATTGG CGACGGACCG
AACCATCTGG TCTACCCGCA GTGGTTGGTG CCAATCGAAC CGGAACGCTG GGCGCCCCTC
CACGGTCAGG GGTATCAGTT GCGCGGCACT GCCAGTGAAA AAGAGGAACT GGACAAAGAT
CCCTGGCAGC GCGCGCCGGC TCGTATTGTG CCGTCCGACA AGGACTTCGA TCCGGTTATT
GCCAAACTGC ATGAAATCTA CGACAAGACG AAGGTCGAGC CGGAGTTTCT CAAGCGCACC
CAGATGGTCT GGGAAATGAT GAAGATCCAT GTCGAGAACG GACCGTTCGT ACAGGGGTCG
GTCGCCAACT TCGACCGCGT GTTCATCGTG AAGAAGGGGT TGATGAACGT GCCGAAGAAG
GAAGACCTGG CGCTTGGCGG ATTTACCGAT CCCTGGATCC ATCCCACGCC GGCGGTCTAT
GACCCTGAAG CCTGGTATTG GGATGATCCG TCAAAGCATA CAAGTTGA
 
Protein sequence
MPVSKSRILS RRRFLTLSAM TAASAAIAAC GGGQPAQAPT TAPAPTAAPT TAQAPTVAIP 
PTTPPQATAV PQTVSKYKEA PELAKLVAEG KLPPVDERLP KNPYVVPHKW LTVGKYGGTI
NFTNSWGPDG MATIVQESQY GHSILRWLDD GLKIGPGLAE SWEANADASE WTFKFREGLK
WSDGHPWTVD DILYWWEYMV GGNGKEKEFP EGLKPIEPPP DEGRSGTGTL ATLIKVDDYT
LTMKFDAPAP LTADRLAMWV NAGIGPRWMA PRHHMEQFNP VLNPDKYKDW DEHTKRIRFV
TNPDCPTMTG WKCESFEEGV RGVYTRNPYY WCVDAEGNQL PYIDRIISTT FQNSEVEKLN
VIQGKSDFTH HWVLSLDDVP NLRQNADAGG YEVRFWESGS GSGTSFFFSY DYNDPKWREL
IRNPKFRQAL SLAFNRAELQ KTQYFGTGEL TTGTFSPKAI EYNIDEKGNP DRANARYREW
RDSYVAFDQE RAKALLDEIG VKVGADGFRT FPDGSPLDIQ VIVASNTSKN TIAQNEQMIR
DWGQIGIKAT LTPVPPQGRR DDWFAGKLMS NADWGIGDGP NHLVYPQWLV PIEPERWAPL
HGQGYQLRGT ASEKEELDKD PWQRAPARIV PSDKDFDPVI AKLHEIYDKT KVEPEFLKRT
QMVWEMMKIH VENGPFVQGS VANFDRVFIV KKGLMNVPKK EDLALGGFTD PWIHPTPAVY
DPEAWYWDDP SKHTS