Gene Rcas_0371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0371 
Symbol 
ID5537833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp462889 
End bp463896 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content59% 
IMG OID640892534 
Productbile acid:sodium symporter 
Protein accessionYP_001430521 
Protein GI156740392 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID[TIGR00841] bile acid transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0981019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.637314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGA ACATCATCAC CGGAGTGTTT CTGCCAATCG CAATCGCCAT CATCATGTGG 
GGCATGGGTC TGTCGCTGGT TGTGGATGAT TTTCGGCGGG TGTTGTTCTA CCCAAAAGCG
GTCGCCATCG GTCTATTCGG TCAGTTGGTT GTCTTGCCGC TGGTTGGCTT CTTCATCGCC
TCGACGTTCA ATCTTCCGCC TGAGTATGCC GTCGGATTGA TGATTGTCGC GCTGTGCCCC
GGCGGTCCGA CGTCGAACCT GATTTCGTTC CTCTCGCGCG GTGATGTGGC GCTGTCGGTG
ACGCTGACGG CAATATCGAA TACGGTGACG GTCATCACGA TTCCGCCGCT GGTCAACTGG
ATGCTGTTTC ACTTCATGGG ACAGGGAACA ACGCTCCAGT TGCCGTTTGT GCAGACCGTC
GTGCAGATTG CGCTATTGAC AATTGTTCCA GTTGCGCTCG GGATGTGGAT GCGGGCAAAA
CGCCCTGAGT TCGCCGCTGA AGCCGATTTT CCGGTGAAGG TTGCATCGGT GGCGCTGCTG
GTTCTGGTGA TCCTGGCGGC GATCATCCGC GAGCGGGCGA TCATCGTCCA GGCATTCATT
GATGTCGGTC CGGCGACGTT GATGTTGAGC GCCGTAAGCA TGCTGCTTGG CTTTACCATC
GCTGCAATCA TGCGTCTCAA CTGGTCGCAG CGCATCACCA GCGGCATTGA GGTCGGCATC
CAGAATGGAA CCCTGGCGAT TGCGCTGGCG TCGGGCGCAA CATTCCTCAA CAACCCCGCC
ATGGCTATCC CGCCGGCGAT CTATAGCCTG GTGATGTTCG GCACGGCTGC GGCGTTCGGG
TTCTTTGTCA ATGCGCGGAT CGGGCGCCGA CAGTGCGCAT GCTGCCTTGA TCGTTTCCGC
CTTGATATCT TCGACCTGAA CCGGACGAAG GGCGAGCGCG ACAGCGAGAT TCCGGCAACT
GCCGCAACCG TGCCATCCGG GCGCGTCCAA ACCTCGACCG GTTTCTGA
 
Protein sequence
MEQNIITGVF LPIAIAIIMW GMGLSLVVDD FRRVLFYPKA VAIGLFGQLV VLPLVGFFIA 
STFNLPPEYA VGLMIVALCP GGPTSNLISF LSRGDVALSV TLTAISNTVT VITIPPLVNW
MLFHFMGQGT TLQLPFVQTV VQIALLTIVP VALGMWMRAK RPEFAAEADF PVKVASVALL
VLVILAAIIR ERAIIVQAFI DVGPATLMLS AVSMLLGFTI AAIMRLNWSQ RITSGIEVGI
QNGTLAIALA SGATFLNNPA MAIPPAIYSL VMFGTAAAFG FFVNARIGRR QCACCLDRFR
LDIFDLNRTK GERDSEIPAT AATVPSGRVQ TSTGF