Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0371 |
Symbol | |
ID | 5537833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 462889 |
End bp | 463896 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640892534 |
Product | bile acid:sodium symporter |
Protein accession | YP_001430521 |
Protein GI | 156740392 |
COG category | [R] General function prediction only |
COG ID | [COG0385] Predicted Na+-dependent transporter |
TIGRFAM ID | [TIGR00841] bile acid transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0981019 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.637314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAGA ACATCATCAC CGGAGTGTTT CTGCCAATCG CAATCGCCAT CATCATGTGG GGCATGGGTC TGTCGCTGGT TGTGGATGAT TTTCGGCGGG TGTTGTTCTA CCCAAAAGCG GTCGCCATCG GTCTATTCGG TCAGTTGGTT GTCTTGCCGC TGGTTGGCTT CTTCATCGCC TCGACGTTCA ATCTTCCGCC TGAGTATGCC GTCGGATTGA TGATTGTCGC GCTGTGCCCC GGCGGTCCGA CGTCGAACCT GATTTCGTTC CTCTCGCGCG GTGATGTGGC GCTGTCGGTG ACGCTGACGG CAATATCGAA TACGGTGACG GTCATCACGA TTCCGCCGCT GGTCAACTGG ATGCTGTTTC ACTTCATGGG ACAGGGAACA ACGCTCCAGT TGCCGTTTGT GCAGACCGTC GTGCAGATTG CGCTATTGAC AATTGTTCCA GTTGCGCTCG GGATGTGGAT GCGGGCAAAA CGCCCTGAGT TCGCCGCTGA AGCCGATTTT CCGGTGAAGG TTGCATCGGT GGCGCTGCTG GTTCTGGTGA TCCTGGCGGC GATCATCCGC GAGCGGGCGA TCATCGTCCA GGCATTCATT GATGTCGGTC CGGCGACGTT GATGTTGAGC GCCGTAAGCA TGCTGCTTGG CTTTACCATC GCTGCAATCA TGCGTCTCAA CTGGTCGCAG CGCATCACCA GCGGCATTGA GGTCGGCATC CAGAATGGAA CCCTGGCGAT TGCGCTGGCG TCGGGCGCAA CATTCCTCAA CAACCCCGCC ATGGCTATCC CGCCGGCGAT CTATAGCCTG GTGATGTTCG GCACGGCTGC GGCGTTCGGG TTCTTTGTCA ATGCGCGGAT CGGGCGCCGA CAGTGCGCAT GCTGCCTTGA TCGTTTCCGC CTTGATATCT TCGACCTGAA CCGGACGAAG GGCGAGCGCG ACAGCGAGAT TCCGGCAACT GCCGCAACCG TGCCATCCGG GCGCGTCCAA ACCTCGACCG GTTTCTGA
|
Protein sequence | MEQNIITGVF LPIAIAIIMW GMGLSLVVDD FRRVLFYPKA VAIGLFGQLV VLPLVGFFIA STFNLPPEYA VGLMIVALCP GGPTSNLISF LSRGDVALSV TLTAISNTVT VITIPPLVNW MLFHFMGQGT TLQLPFVQTV VQIALLTIVP VALGMWMRAK RPEFAAEADF PVKVASVALL VLVILAAIIR ERAIIVQAFI DVGPATLMLS AVSMLLGFTI AAIMRLNWSQ RITSGIEVGI QNGTLAIALA SGATFLNNPA MAIPPAIYSL VMFGTAAAFG FFVNARIGRR QCACCLDRFR LDIFDLNRTK GERDSEIPAT AATVPSGRVQ TSTGF
|
| |