Gene RoseRS_4010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4010 
Symbol 
ID5210993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5017108 
End bp5018319 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content62% 
IMG OID640597599 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_001278305 
Protein GI148658100 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000079745 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGAC GAGTGTTTCT TCGCAGCGTC GCGGCGGGCA GCGCAGCCCT GACAGCAGCC 
ACACTGGCAG CGTGTGGTCA GGCGCCGCAG ACGCCAGTTC AGCAGGCGAC GACCGCCCCG
GCACAGCAGG CGACGACCGC CCCGGTACAG CAGGCAACGC AGGCGCCGGT CGCCACTGCG
CCACAACCGC AGGCGCCAGC GCAAACGAGC GAGATGCCCT CTCTTGAGTG GGATATGGCT
ACCAGCTGGC CCGTGGCGCT CGACACGATT TTCGGCGGAG CGAAGACAGT TGCTGACCGT
GTGGCGGCAT TGACAGACGG TAAGTTTAAA ATCACGCCAC GCGCTGCGGG CGAACTGGCG
CCTGCCTTGC AGGTGCTTGA TGTGGTGCAG CAGGATGCCG TGCCGATCGG TCACACCGCA
TCGTACTACT ATGTCGGCAA GAGTCCGGTG ACCGCGTTTG GCACTACGGT GCCCTTCGGT
CTCAACGCAC AGCAGCAAAA TGCCTGGTTG TACGACGGCG GCGGGCTGGA AAAATTGCAG
GCGGTGTACG CCAAACTGTT CAATGTTATT CAGTTCCCGG CGGGCAATAC CGGCGTCCAG
ATGGGTGGGT GGTTCCGCAA GGAGATCAAC ACCGTCGCCG ACCTTCAGGG TCTCAAGATG
CGCATCCCCG GTCTCGGCGG GCAGGTGTTG ACCAAACTGG GAGTCACCGT TCAGGTCATT
CCGGGTGGTG AGATCTTCCA GGCGTTGCAG ACCGGCGCGG TCGACGCGGC GGAATGGGTC
GGGCCGTATG ACGATGAGAA ACTCGGACTG AACAAGGCGG CGAAGTTCTA CTACTATCCG
GGCTGGTGGG AGCCGGGTCC TACACTCGAG GTGCAGGTCA ACCTCGACAG GTGGAACGAA
CTGCCAAAAG TCTACCAGGA GGCGATTAAG ACCGCATCCG CCGAGGCGAA TATCACGATG
CTGGCGCGGT ACGATGCGCG CAACCGTGAA GCGCTCAAGC GCCTGGTGGA CGGCGGCGCG
CAACTGCGCC CGTACAGCAA GGAAATCCTT GCCGCAGCCG AGAAAGCCGC CTTCGAACTG
TACGATGAGT TCGCCGCGAA AGACGCCGAC TTCAAGGAAA TCTACGAGGA GTGGAAGGCG
TTCCGCGAGG CCATCTATGA GTGGAACAAG GTGAACGAAG CCGGGTACAC CAACTACGCC
TACAATAAGT GA
 
Protein sequence
MRRRVFLRSV AAGSAALTAA TLAACGQAPQ TPVQQATTAP AQQATTAPVQ QATQAPVATA 
PQPQAPAQTS EMPSLEWDMA TSWPVALDTI FGGAKTVADR VAALTDGKFK ITPRAAGELA
PALQVLDVVQ QDAVPIGHTA SYYYVGKSPV TAFGTTVPFG LNAQQQNAWL YDGGGLEKLQ
AVYAKLFNVI QFPAGNTGVQ MGGWFRKEIN TVADLQGLKM RIPGLGGQVL TKLGVTVQVI
PGGEIFQALQ TGAVDAAEWV GPYDDEKLGL NKAAKFYYYP GWWEPGPTLE VQVNLDRWNE
LPKVYQEAIK TASAEANITM LARYDARNRE ALKRLVDGGA QLRPYSKEIL AAAEKAAFEL
YDEFAAKDAD FKEIYEEWKA FREAIYEWNK VNEAGYTNYA YNK