Gene Rcas_1534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1534 
Symbol 
ID5539010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1956805 
End bp1958178 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content62% 
IMG OID640893672 
ProductPUCC protein 
Protein accessionYP_001431645 
Protein GI156741516 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00123663 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.002835 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCTGC TGCGCACAAT CGGTCGGTTC TTGCTGGGCG CGCTGAAGGT GTTGCGTCTG 
GCGCTGCCAA AACTGGGCGT TGGCTGGATG TTTGCCCTGC TCACCAGCAA CTTCAATCGC
GTGTCGATCG TTGAACTGGG GGTTATGGCC GTCATCGTGA CAACGATGAT CGGCCTGCAC
CATTTCCTGT CGCCATTCCA GGTGGTCTGG GGGCGCATCG CCGACAATCA TCCGGTCTTC
GGGCTGCGCC GCACGCCGTA CCTGCTCCTC GGCGCAACGG TTGCCAGCCT GGTGTTCCTG
GCATTGCCAT CGGTTGCACT GGCGATGGGT GAAGGGTCGG TGCTGGCGAT TGTCGCCGGG
TATGCGCTTC TCATTATCTT TGGCATCAGT ATTGCAGCCA TGGGCGACTG CCACCATGCG
CTGATTGCCG AGGTGACCAA CCCAAAGACG CGCGGCGGCG TGATTGCGGT GGTCTGGACA
TTCACCATCA TGAGCACGAT TATCGCTGCA ACGGTCATCA AGGCGCGCAT GCCAGCGTAT
ACGCCTGAGT CGATGCAGGC GCTCTACAAT CTGACGCCGC TGGTGGTGAT CGGTGCGACG
CTGCTCGGCG TGCTCGGTAT CGAAAGGCGT CTGAGCCGCC AGGAACTGGC GGTCGCCCTC
GAACGCGCAC GCGCTGCTGC GCCTCCGGGC AACCCGCTGA GCGCAGCGTT CGGTCTGCTG
CGCCAGAACC CGCAGGTGCG CGCGTTCTTC GGGTTTGTGT TCCTTTCGAT CATCGGTATC
TTCCTGCAAG ACTCGATCCT GGAGGTCTTT GGCGCTGAGG TCTTCCACAT GACCTTGAAG
GAGACGACGA CATTTACGCA AACGTGGGGC GGCGGTGTGC TCGGCGGTAT GCTGATCATG
GGTCTGCTCA GCGCCATCTT CCCGATCGGC AAGAAACTGA TCGCGCTGAT CGGCGGCGTC
GGCACCGCCT TTGGTCTCGG ATTGCTGGCG GTCTGTGCGC TTACCGAACA GCGCGCGTTG
CTCAATCCGG CAATTATGCT GATGGGCGTC AGCACCGGGC TGTACAACGT CGGTGCGCTG
TCGCTGATGA TGGATATGAC CGTCGAAGGC GCCACGGGTC TGTACATGGG CATGTGGGGC
ATGGCGCAGG CATTCGGCAC GGCAACGGCG AATATTCTGG CAGGTGCGCT CCACACGGTG
CTGATCGAGG CGCAGGCGCT TAGCCAGACG CTGGGGTATG GCGTGATCTT CGGTCTGGAA
GCAGTGCTGA TGATTGTCGG CATTGCGCTG CTGTCCGGCG TCAGCGTCGA AGCCTTCCGT
GGTTTGACGC GCGCCGACAT TACGCGCGCC ATGGAAGCGG GCGCGGTCGC TTGA
 
Protein sequence
MNLLRTIGRF LLGALKVLRL ALPKLGVGWM FALLTSNFNR VSIVELGVMA VIVTTMIGLH 
HFLSPFQVVW GRIADNHPVF GLRRTPYLLL GATVASLVFL ALPSVALAMG EGSVLAIVAG
YALLIIFGIS IAAMGDCHHA LIAEVTNPKT RGGVIAVVWT FTIMSTIIAA TVIKARMPAY
TPESMQALYN LTPLVVIGAT LLGVLGIERR LSRQELAVAL ERARAAAPPG NPLSAAFGLL
RQNPQVRAFF GFVFLSIIGI FLQDSILEVF GAEVFHMTLK ETTTFTQTWG GGVLGGMLIM
GLLSAIFPIG KKLIALIGGV GTAFGLGLLA VCALTEQRAL LNPAIMLMGV STGLYNVGAL
SLMMDMTVEG ATGLYMGMWG MAQAFGTATA NILAGALHTV LIEAQALSQT LGYGVIFGLE
AVLMIVGIAL LSGVSVEAFR GLTRADITRA MEAGAVA