Gene Rcas_2184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2184 
Symbol 
ID5539665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2805618 
End bp2806505 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content61% 
IMG OID640894318 
Productextracellular solute-binding protein 
Protein accessionYP_001432286 
Protein GI156742157 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0092646 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAAC AAATGCTACT TATCGTGACG CTGATCGCCG CCATCCTGGC GATTAGCGCC 
TGCGGCGGAG CGCCTGCCGC GCAGCCGACC CAACCCGCCG CGCAGCCGAC CCAACCCGCC
GCGCAGCCGA CCCAACCCGC CGCGCAGCCG ACCCAACCCG CCACGCAACC AGAGGGGAAA
CTGGCGCAGA TCCGGGCAGC CGGCAAACTC ATCGTCGGCA CGTCGGCAGA CTACCCGCCC
TACGAGTCGA TCGACGCCAA TGGCAACTTC GTCGGTTTCG ACATGGACCT CATCCGCGCT
GTCGGCGAAA AACTCGGCGT CCAGGTCGAG ATTCGCGATA TGCCGTTCGA CTCGTTGATC
GCATCGCTCC AGGAAGGCAA GATCGATGCC GTGATCGCCG CCATGCAGGC GACCGCCGAG
CGTGAAGAGA AGGTCGATTT CACCATTCCT TACCGCATGA CGAAAGATGC ATTTATCGGC
GCCGGCGATA CAACAATTGT CATGAGCAAA CCGGAGGATG CGGCGGGAAT GACCATCGGC
GCACAGACCG GTACGGTTCA GGAAGGGTGG ATTCAGAAGA ACCTGGTGGC CACCGGATTA
ACGCCTGCCG ATAAGGTCTT CAGCTATGAG CGCGCCGATC AGGCAGCGCT CGACCTTGCC
AGTGGACGGC TCCAACTGGT GCTGATGGAC GCCGAACCCG CGTTGGAACT TGCCCAAAAG
AATAATCTGA AGGTGCTGCT CGTCACTGAG ACAACCGCCG AGGGCGGCAA GAGCATCGCC
ATCCCTGAAG GCGCCGGTGA CCTCAAGGCG GAACTGGATC GGATCATTCA GGGATTGATC
GATGACGGCA CTGTGAAAGC GCTCGAAGAA AAGCACGGAC TGCCATAA
 
Protein sequence
MKKQMLLIVT LIAAILAISA CGGAPAAQPT QPAAQPTQPA AQPTQPAAQP TQPATQPEGK 
LAQIRAAGKL IVGTSADYPP YESIDANGNF VGFDMDLIRA VGEKLGVQVE IRDMPFDSLI
ASLQEGKIDA VIAAMQATAE REEKVDFTIP YRMTKDAFIG AGDTTIVMSK PEDAAGMTIG
AQTGTVQEGW IQKNLVATGL TPADKVFSYE RADQAALDLA SGRLQLVLMD AEPALELAQK
NNLKVLLVTE TTAEGGKSIA IPEGAGDLKA ELDRIIQGLI DDGTVKALEE KHGLP