Gene Rcas_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2454 
Symbol 
ID5539935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3152757 
End bp3154805 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content59% 
IMG OID640894584 
Productextracellular solute-binding protein 
Protein accessionYP_001432552 
Protein GI156742423 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.39331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAA ACGCCAGGCG ATCAATCAGT CGCCGCAATT TTCTGCGCCT GTCGGCTGTC 
CTCGGCGCAA GCGCCGGCCT TGCCGCCTGC GGCGGCGCTC CGACCGCTCC CGCTCCCACT
GCCGCTCCAT CCGCTCCAAC TGCGCCACCG GCGCCAACGG TCGCTCCTGT TGCCACTGCT
GCTCCGCCAC AGGCAGCAAC CGCCGTGCCT GTAGCGGTGT CGCAGTTCAA CGAAGCGCCC
GTCCTGGCGG AACTGGTCCA ACAGGGCAAA CTGCCGCCGG TCGATCAGCG CCTGCCCAAG
AATCCAGTGG TGCTGACCGG TCTCGATGGC GTTGGCAAAT ATGGCGGAAC CATCCGTCGC
GGCTTCCGCG GCTTTTCGGA CCGCTGGGGA CCGACCAAGA TTCAGAACGA GGGGCTGACC
TGGTATAACC CTGACTTAAG CCTGCGCGCG AACATCGTCG AGTCGTGGGA GGTCAGCAGC
GATGCGCGGC AGTGGACGTT CAAGCTACGC GAGGGGATGA AGTGGTCCGA TGGTTCGGAC
TTCACGACCG ACGACTGGAA ATGGTGGTAC GAGAACGTCC TGCTCAACGA TCAGATCACG
ACGGCGCTTC CCGGTGTTTG GTCAACCGGC TCACCGCGCG TGGCGATGAA AGCGGAGTTT
CCCGATCGCT ACACGGCGGT GTTCACGTTC CAGGACCCCA AGCCACTCTT CGCCTACAAC
GTCACCCGCG AGCAACCGTT CGTTCATGCC GGTTTCATGA AGCAGTTCCA CGAAGGCTTC
GCCGATAAAG CCAAACTTGA GGCGGATGCG AAGGCGGCAG GGTTCGAGAC GTGGGTGCAA
CTGTTCAACG ACAAAAATTG GTGGTGGACG GTCGGGCGTC CATCGCTCGG TCCGTGGGTA
GCCGTCAACA CCATGACCGA GCAGTTGTTC ATTATGGAGC GCAACCCTTA CTTCTGGCAG
GTGGATGAAG ATGGCAATCA GTTGCCGTAC ATCGACAAGA TCACCCACCT GCTGTTCGAG
ACGCCCGATG CCTTCAACAC GCGCATCATT GCGGGCGAGG TCGATTTTCA GGCGCGCCAC
GTGAGCATTG GCGACTTCAC ACTCCTCAAA GAGAATGAGA GTCGCGGCGA TTATCAGGTA
GTGCTGGGCG TCAGCGCCAA CCATATCGCT TTCCAGCCGA ATCACACGGC GAAGAACCCG
AAGATCCGCG AGTTCTTCCA GAACCGCGAC GTGCGCATTG CACTCTCATT GGCGATCAAC
CGCGACGAAA TCAACGAACT GGTCTTCAAT GGCACCGCTG TGCCGCGCCA GTACAGCCCG
TTGAGCATGT CGCCGCAGTA CTACGAGAAA CTGACGAAAG CGTATATCGA GTACGATCCG
GCGCGCGCCA ATCAACTGCT CGATCAGGCC GGCTACGACA AGAAGGATAC TCAGGGCTTC
CGCCTCTGGA AGGATGGCAG CGGTCCGATC AGTTTCATCA TTGAGGGCAC GGCGCAGGCG
GGTTCACCCG ATGAAGACGC CGTCCAGACG ATGATCAAGT ATTTTGCCGA TGTCGGCATT
AAGGCGACGT ACAAAGCGCA GGAACGTTCG CTGTACACCG AACGCTATCA GGCGAACGAG
GTCGAGGCGG CGTTCTGGGG TGGTGACCGG ACGCTGTTAC CCATCGTGGC GCCCTGGATT
TTCCTGGGGA GCATGATCGA CCGTCCCTGG GCTGCCGCGT GGGGAATCTA CTACAACAAC
CCGAATGACC CCAACGCTGA ACAACCGCCG GAGGGTCATT TCATCACCAA AATCTGGGAT
ATTCAGAAGC AGATCGATGT CGAGCCGGAC GAGGAGAAGC GCAATGCGCT CTTCCGTCAG
ATTCTCGACA TCTGGGCGGA GGAGTTGCCG ATGATCGGCA TCCTCGGTGA ACTGCCATCA
CCGACGATTG TGAAGAACGG CTTCAAAGGC TTCCGCGCCG GATTCCCCAA CGACGATACA
ACCGAGGACG AGAACCTGCT CAACACGCAG ACCTACTACT GGGATGATCC GGCAAAACAC
ACCAATTAG
 
Protein sequence
MNKNARRSIS RRNFLRLSAV LGASAGLAAC GGAPTAPAPT AAPSAPTAPP APTVAPVATA 
APPQAATAVP VAVSQFNEAP VLAELVQQGK LPPVDQRLPK NPVVLTGLDG VGKYGGTIRR
GFRGFSDRWG PTKIQNEGLT WYNPDLSLRA NIVESWEVSS DARQWTFKLR EGMKWSDGSD
FTTDDWKWWY ENVLLNDQIT TALPGVWSTG SPRVAMKAEF PDRYTAVFTF QDPKPLFAYN
VTREQPFVHA GFMKQFHEGF ADKAKLEADA KAAGFETWVQ LFNDKNWWWT VGRPSLGPWV
AVNTMTEQLF IMERNPYFWQ VDEDGNQLPY IDKITHLLFE TPDAFNTRII AGEVDFQARH
VSIGDFTLLK ENESRGDYQV VLGVSANHIA FQPNHTAKNP KIREFFQNRD VRIALSLAIN
RDEINELVFN GTAVPRQYSP LSMSPQYYEK LTKAYIEYDP ARANQLLDQA GYDKKDTQGF
RLWKDGSGPI SFIIEGTAQA GSPDEDAVQT MIKYFADVGI KATYKAQERS LYTERYQANE
VEAAFWGGDR TLLPIVAPWI FLGSMIDRPW AAAWGIYYNN PNDPNAEQPP EGHFITKIWD
IQKQIDVEPD EEKRNALFRQ ILDIWAEELP MIGILGELPS PTIVKNGFKG FRAGFPNDDT
TEDENLLNTQ TYYWDDPAKH TN