Gene Rcas_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1041 
Symbol 
ID5538507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1357692 
End bp1358978 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content67% 
IMG OID640893178 
Productmajor facilitator transporter 
Protein accessionYP_001431161 
Protein GI156741032 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTGGC TGTACGGCGA AATGGCTTCC GCGATGCCCA AACCACGACT ATCCATCTGG 
CGTCATCGTG ATTTTCTCCT TCTCTGGAGC GCCACGGCGG TCAGTCAATT GGGGACGCAG
ATCACCTTCC TCGCTCTGCC GTTCATTGCC GTCACTCTGC TGGACGCCTC GCCGCTGGAC
ACCAGCATCC TGGCAATGCT CGGATGGGCG CCGATGATTA CGCTTGGACT GGTCGCCGGC
GCCATTGTGG ACCGTATGCG CCGCCAGCCG CTCCTCATTG GGTGTGATGT GGCGCGGGCG
CTGGCGGTCG CCGCCATTCC CATCACGTAC CTAGCCGGCT GGCTCTCTCT CTGGCATCTC
TATGCGACCG TCCTGATCAC CGGGCTGTTC AGCACGTTGT TCGACCTGGC CTATCAGGCG
CGGCTGCCGA CCCTGGTTGC GCGCGACGAC CTGATCGCCG CCAATAGCGG GCTGGAACTG
GCGCAGTCCG GCACACGCAT CATCGGTCCC GGACTGACCG GCGCGCTGAT CGCCGTCTTC
ACTGCGCCGG TGGCGATCTT GTTCGATGCA CTGAGCTATC TGGCGTCGGC GCTGCTCCTC
CTGGGAATCC GCCAGCCGGA ACCGCCGCTC GTCGCGGCGC CCCGCGCGGG ATCGGTGACA
CACGTGCTAC GCCGGGAGAT GCGTGAAGGC ATGGTCAGTC TCTGGCGTCA ACCGTTGCTG
CGCACGCTGC TCGGCGCCAC ATTGGGCTTG AGCATCGGCT GGGCGCTGGT GGAGGGAATT
CTCATGTTCT ACATCGTGCG CACGCTAGCG CTGCCAGCGG AGGCGGCTGG CGCGGTCTTC
AGCATTGGAA ACATCGGGTT GCTGATCGCC GCCGCCCTCG CCAGCCAGGT GACGCGACGC
TGGGGCTTGG GTCCGGTCAT TGTCGGCGCG GCCGGGTTGC AGACGCTCGG ACTAGCGTTG
CTGGCGCTGG CGCCGCTGGC GCCGCTGGCA CTGCTCACCA CCGGGTATCT GATGCGGGCC
GCAGGAGTCG TGCTGTACAA CCTGAGCCAT CTGACCCTGC GCCAAAGCAT CACGCCGCTG
CACCTGCTGG GACGGGTGAG CGCCGCTGTG CGCGTCCTGG GGTGGGCTAG CATTCCGGTC
GGGCTGGTCG CTGGCGGCTG GCTGGCGACG CTGAGCGGGC CGTCAGTCGC AATCTGGAGC
GGCGCCGCGT TCAGCGCGCT GGCGCTTGTG CCGCTGGCGC TAGGGCGGAT CTGGCAGGTG
CGCACTGCGC CAGCCCCCGC CCCCTGA
 
Protein sequence
MHWLYGEMAS AMPKPRLSIW RHRDFLLLWS ATAVSQLGTQ ITFLALPFIA VTLLDASPLD 
TSILAMLGWA PMITLGLVAG AIVDRMRRQP LLIGCDVARA LAVAAIPITY LAGWLSLWHL
YATVLITGLF STLFDLAYQA RLPTLVARDD LIAANSGLEL AQSGTRIIGP GLTGALIAVF
TAPVAILFDA LSYLASALLL LGIRQPEPPL VAAPRAGSVT HVLRREMREG MVSLWRQPLL
RTLLGATLGL SIGWALVEGI LMFYIVRTLA LPAEAAGAVF SIGNIGLLIA AALASQVTRR
WGLGPVIVGA AGLQTLGLAL LALAPLAPLA LLTTGYLMRA AGVVLYNLSH LTLRQSITPL
HLLGRVSAAV RVLGWASIPV GLVAGGWLAT LSGPSVAIWS GAAFSALALV PLALGRIWQV
RTAPAPAP