Gene Rcas_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0066 
Symbol 
ID5537525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp81867 
End bp83078 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content61% 
IMG OID640892232 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_001430222 
Protein GI156740093 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000558926 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00313159 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACGAC GAGTGTTTCT TCGCAGCGTC GCGGCGGGCA GCGCAGCCCT GACGGCTGCT 
ACGCTGGCAG CGTGCGGCCA GGCTCCGCAG ACGCCAGCCC AGCAGGCGAC GACTGCCCCG
GCTCAACAGG CGACGACCGC CCCGGCTCAG CAGGCGACGA CCGCGCCGGT AGCGACTGCG
CCACAGGCGC AGGCGCCAGC CCAGACGAGC GAAATGCCAT CCATTGAGTG GGACATGGCC
ACCAGTTGGC CCGTTGCACT CGACACGATC TTTGGCGGTG CGCAAACGGT CGCTGATCGT
GTCGCAGCGA TGACCGACGG GAAGTTCAAA ATTACGCCGC GCGCCGCCGG TGAACTGGCG
CCCGGTCTCC AGGTGCTCGA TGTGGTGCAG CAAGATGCCG TTCCGATTGG CCATACCGCG
TCGTATTACT ATGTCGGCAA AAGCCCGGTC ACGGCATTCG GCACATCGCT GCCTTTTGGT
CTCAATGCAC AGCAGCAGAA TGCCTGGTTG TACGATGGCG GCGGTTTAGA GAAGTTGCAA
GCGGTGTACG CCAAACTGTT TGGCGTCATT CAGTTTCCGG CCGGCAACAC CGGCGTTCAA
ATGGGTGGCT GGTTCCGCAA GGAAATTAAC ACTGTCGCTG ATCTCCAGGG TCTCAAGATG
CGTATCCCCG GCCTCGGCGG GCAGGTGATG ACGAAACTCG GCGTCACGGT GCAGGTCATT
GCAGGCGGTG AGATCTTCCA GGCGCTCCAG ACGGGCGCTG TCGATGCGGC AGAATGGGTC
GGTCCGTATG ACGACGAGAA ACTCGGTCTG AACAAGGCAG CACAGTTCTA CTACTATCCG
GGTTGGTGGG AGCCGGGTCC TACGCTCGAA GTGCAGGTCA ATCTCAATCG CTGGAATGAG
TTGCCCAAGA CGTATCAGGA GGCGATCAAG ACCGCATCAG CCGAGGCGAA TATCACGATG
CTTGCGCGCT ACGACGCACG CAACCGCCAA GCCCTCAAGC GTCTGGTGGA CGGCGGTGTG
CAATTGCGTC CGTATAGCAA AGAAATCCTC GACGCTGCCG AGAAGGCCGC TTTTGAGCTG
TACGACGAGT TCGCCGCCAA GGATGCCGAT TTCAAGGCGA TCTACGAGGA ATGGAAAGCA
TTCCGCACGG CGATCTACGA GTGGAATAGG GTGAACGAGG CAGGGTTCAC CAACTACGTC
TACAGCAAGT AG
 
Protein sequence
MRRRVFLRSV AAGSAALTAA TLAACGQAPQ TPAQQATTAP AQQATTAPAQ QATTAPVATA 
PQAQAPAQTS EMPSIEWDMA TSWPVALDTI FGGAQTVADR VAAMTDGKFK ITPRAAGELA
PGLQVLDVVQ QDAVPIGHTA SYYYVGKSPV TAFGTSLPFG LNAQQQNAWL YDGGGLEKLQ
AVYAKLFGVI QFPAGNTGVQ MGGWFRKEIN TVADLQGLKM RIPGLGGQVM TKLGVTVQVI
AGGEIFQALQ TGAVDAAEWV GPYDDEKLGL NKAAQFYYYP GWWEPGPTLE VQVNLNRWNE
LPKTYQEAIK TASAEANITM LARYDARNRQ ALKRLVDGGV QLRPYSKEIL DAAEKAAFEL
YDEFAAKDAD FKAIYEEWKA FRTAIYEWNR VNEAGFTNYV YSK