Gene Rcas_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2647 
Symbol 
ID5540129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3412374 
End bp3413948 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content61% 
IMG OID640894770 
Productmajor facilitator transporter 
Protein accessionYP_001432737 
Protein GI156742608 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTGC ATCCGGCGTC ATTGCGAACG ATAGTGACGC AATCTCTCCT CGCTCAAGAA 
ATCGTCTCGC TGCGGATGCG CCTCGCAATG ACAATATGGC AAAGACGCGC CTTCTTTCAT
CCAAACTCAT ATGAGCGAGG CTTCGGATTG GTGCGGGCGC GACGCCCACG CTCCCAGTGC
TCCCAGGACC GGTGCGGGCG CAACGCCTGC GTGGGTGCGC CGTGCCGCCT GCCACACCAG
CGCCGCGTTA TGGATGCATC TTCCGCCTGC AAGATGTGGT ATGATAACCG TTCCTGCACC
TTCTCTGAAC CGACACAGAA ACCTATGCAC ACATCAACGC CTTCGCCTCG TTTTGCTGCA
CTGACCGCGC TGCGGTACCG CGACTTCCGG CTGCTCTGGG CTGGTCAGTT TGTGTCGATC
ACCGGCACGC AGATGCGCAA TGTCGCCATT GCATGGCAGG TGTACCGGCT GGCACAGGCG
GACAGCAGCA TTCGGGTTGA AATCGCGCTT GGACTGATCG GTCTGGCGCG CGTTATTCCG
CTGATCCTGA CGGCGATGTT CAGCGGTATG ATCGCCGACC GCGTCGAGCG GCGCAAAATT
CTGATTCTGA CCTCGCTCGT GGCGTTTGTG TGTTCAATAG TGCTGGCGCT TACCGGCGAG
ATGGAGCGCC CGCCGTTGCT GCTCATCTAC ACGATGGTGG CGCTGGCATC GGTCGCCGGC
GCCTTCGAAC TGCCGGCGCG TCAGGCAATC ATTCCCAACC TCGTTGCGCC ACAGCACTTG
CCAAATGCGT TGAGTCTGAA CATCGTCGCC TGGCAACTTG CGACGGTTAT TGGTCCGGCG
TTGTCAGGCG TCTTGATTGC TGCGGTCGGT GTTGCGCCAG TGTACTGGAT CGATGCCGCG
ACCTTTCTGG CAGTCGTTGC GGCAGCGTTG CTCATGCGCA CGCGCACCAT TCCGGCGCGC
ATCGAACCGG TGTCGCTCCG GGCGGCGCTG GCGGGGCTGC GTTTCGTCTT TTCACATCGC
CTGATTGCTG CAACCATGCT GCTTGATTTC TTCGCTACAT TTTTTGGCGC TACTGGAGTG
CTGCTACCGA TCTTTGCCGA TCAGGTCTTG CGGGTCGGAC CGACCGAACT GGGCTGGATG
TACGCCGCGC CATCGGTGGG AGCAGTGGTC GCTGCAACCC TGCTCAGCGG TGTGCGCATT
CCACGACAGG GGACGACGCT GCTCGCGGCT GTGCTGGCGT TTGGCGCATG CGTCGCAGTG
ATCGGGATGT CGCGTTGGCT CCCGCTAACG CTGGCAGCGC TGGCAGGCAT GGGTGCAGCG
GATACCGTCA GTATGGTCAT TCGCGGCGCG ATCCGTCAAT TGCTCACTCC CGATGAATTA
CGCGGGCGCA TGGTGGCGGT CAATATGGTC TTCTTTGCCG GCGGCCCGCA ACTTGGAGAA
ACCAGCGCCG GTTTTATCGC CAGCCTGATC GGTGCGTCTG CGGCGGTAAC CCTCGGCGGC
GTGGCGTGTA TCCTCCTGGT TGTTGGAACA GCGCTCAGCG TGCGTGAACT GCGCGAGTAC
CAGGGACCGG GCTGA
 
Protein sequence
MTLHPASLRT IVTQSLLAQE IVSLRMRLAM TIWQRRAFFH PNSYERGFGL VRARRPRSQC 
SQDRCGRNAC VGAPCRLPHQ RRVMDASSAC KMWYDNRSCT FSEPTQKPMH TSTPSPRFAA
LTALRYRDFR LLWAGQFVSI TGTQMRNVAI AWQVYRLAQA DSSIRVEIAL GLIGLARVIP
LILTAMFSGM IADRVERRKI LILTSLVAFV CSIVLALTGE MERPPLLLIY TMVALASVAG
AFELPARQAI IPNLVAPQHL PNALSLNIVA WQLATVIGPA LSGVLIAAVG VAPVYWIDAA
TFLAVVAAAL LMRTRTIPAR IEPVSLRAAL AGLRFVFSHR LIAATMLLDF FATFFGATGV
LLPIFADQVL RVGPTELGWM YAAPSVGAVV AATLLSGVRI PRQGTTLLAA VLAFGACVAV
IGMSRWLPLT LAALAGMGAA DTVSMVIRGA IRQLLTPDEL RGRMVAVNMV FFAGGPQLGE
TSAGFIASLI GASAAVTLGG VACILLVVGT ALSVRELREY QGPG