Gene Rcas_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1022 
Symbol 
ID5538488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1331690 
End bp1332994 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content64% 
IMG OID640893161 
ProductPUCC protein 
Protein accessionYP_001431144 
Protein GI156741015 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTGGA CAACCATCTT CCGGCTTGGA CTCATCAACG TCGCCGTTGC ACTGACGGCC 
GTTCCCATCG AAAGCACGCT CAACCGGGTC ATGATCAGCG AACTGCGGTT GCCTGCCTCG
CTCGTGGCGC TGCTGATCGC GTTGCCCTAC GCTTTTTCGC CGATCCAGAT CTGGATTGGG
TCATTCTCAG ATCGCCATCC CTTGCTGGGG CTTCGCCGCA CACCCTACAC GGTCATCGGG
CTGCTGCTGT GCGCTGGCGG GTCGGCGCTC TCGCCGACGG CGGCATTTGC CATGGCGGAG
AACTTCTGGG CAGGGTTGCC GCTGGGGCTG CTGGCATTCG GCGCGTGGGG CATGGGGTTC
AACTTTGCGA CGGTATCGTA CCTGTCGTTG GCGACGGATC TCTCTGGCGA AGAGCACCGC
GCCCGCACCG TCGGCGTCAT GTGGTTTATG CTGATTGTGA GTGTCATCAT TGCCGGAGTA
AGCCTTTCGC GGATGCTGCG CGACTACACG CCGGCGGCGC TGTTCACGGC GTTCTACGGA
GTGTGCGCTG TTGCGTTGCT GATTGGCATC GCCGCACTCT GGGGGCTTGA AGCGCGCGAC
GACGAAGCGG CGCCCGCTGA GCGGCGCAGT TTTGCGCAGA TGATCGCCAC GGTCGCCGGA
ACGCCGCAGG CACGCCTATT TTTTATCTAT CTGGTACTAC TGCTGATTGC CATCCTGGGG
CAGGACGTGC TGCTCGAACC ATTCGCCGCC GACGTGTTCG GCGTACCGGT TGAAGTCACA
ACGCGCTACA CCTCGATCTG GGGCGCAGCG CTACTGGTTG GACTGCTGGC GACGAGTCCG
CTGGCGCGAC GGCGCGGAAA ACCATTCGCC GCCGCAATCG GCGGAACGCT GGTTGCGATC
GGGTTAGTGT TGATCGCGCT GACCGGCATC CTGGGCATGC CGGAGATCTT CACGCCGAGT
CTGATCCTCT TCGGATTTGG CAGCGGCGTT TCGACCGCCG CGAACCTGGC GCTCATGCTC
GATATGACCA TTCCGGGGCA GATCGGCGCT TTTGTCGGCG CGTGGGGCGT CGCCAATTCG
ATGGCGCGTC TGCTGGGAAC CGTGTTGAGC GGCGTCACCC GTGACATTTT GACCGACCTG
TTCAATGATC GCATGCCCGG ATACGTTATC GTCTTCCTGC TTCAGGCTGC TGCGATGGCG
GCGTCGCTGG CGCTGCTGCC GCGGATCAGC ACCGCGCGCT TCCGCCACGA AATGGCGCCG
TCGGCGCGTG AACTCGCGGC GCTTGCCGGC GAGGCGCAGG GGTAA
 
Protein sequence
MRWTTIFRLG LINVAVALTA VPIESTLNRV MISELRLPAS LVALLIALPY AFSPIQIWIG 
SFSDRHPLLG LRRTPYTVIG LLLCAGGSAL SPTAAFAMAE NFWAGLPLGL LAFGAWGMGF
NFATVSYLSL ATDLSGEEHR ARTVGVMWFM LIVSVIIAGV SLSRMLRDYT PAALFTAFYG
VCAVALLIGI AALWGLEARD DEAAPAERRS FAQMIATVAG TPQARLFFIY LVLLLIAILG
QDVLLEPFAA DVFGVPVEVT TRYTSIWGAA LLVGLLATSP LARRRGKPFA AAIGGTLVAI
GLVLIALTGI LGMPEIFTPS LILFGFGSGV STAANLALML DMTIPGQIGA FVGAWGVANS
MARLLGTVLS GVTRDILTDL FNDRMPGYVI VFLLQAAAMA ASLALLPRIS TARFRHEMAP
SARELAALAG EAQG