Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0525 |
Symbol | |
ID | 5897980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 573940 |
End bp | 574950 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641561008 |
Product | bile acid:sodium symporter |
Protein accession | YP_001682157 |
Protein GI | 167644494 |
COG category | [R] General function prediction only |
COG ID | [COG0385] Predicted Na+-dependent transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCTC GCCGCCTTCG CCTGCCCCTC GATCCCTATC TTCTGGCCCT GCTGGCCACC GTTGCCCTGG CCTTCCTCCT GCCCGCCCGG GGCGGGGCGC GGACCGTCCT GAACGGGGCG ACCTACGCCG CCGTCGCCGG TCTGTTCTTC CTGTACGGCG CCAAGCTTTC GCCCCGCGCG GTCTGGACCG GGCTGACGCA CTGGCGGCTT CAGGCTCTGG TCTTCGCCAG CACCTATGTG CTCTTTCCGC TGATCGGTTT GGCGATCGGG GTGCTGGCGC GACCGCTCCT GCCCGCCGAC ATCGTCGCCG GCCTCGTCTT CCTGTGCTTG CTGCCCTCCA CCGTGCAGTC GTCGATCGCC TTCACTTCGA TCGCTCGCGG CAACGTGGCG GCGGCCCTAT GCAGCGCCTC GTTGTCGAAC ATGGCGGGCG TGGTGGTGAC GCCGCTGCTG GTGTCGCTGA TCCTGCCAAC CAGCGGCGGT CTTAGCCTGT CGTCCTTGAG TGACATCGGT CTGCAGATCT TGTTGCCCTT CGCCCTGGGC CAGATGCTGC GCCCCTGGAT TGGCGCTGGG CTGGGGCGCC ATGCGCGCAT CACCGGCCTG ATGGATCGCG GCTCGATCCT GCTGATCGTC TATGCCGCCT TCGGCGCAGG CGTGGTGGGC GGGGTGTGGA AGAGAGTGTC CGGACACACC CTGATCCTGA TCCTGGTCTT CGACCTGCTG ATCCTGGCTG TCGTGATCGC CCTCACCACC TGGGCCAGCC GCCGGGTCCG CGCCTCGACC GAGGACGAGA TCGCCATTGT TTTCTGCGGC TCCAAGAAAA GCATGGCCAG CGGCATTCCC ATGGCCAACA TCCTGTTCGC CGGCCACGCG GTGGGACTGG TCGTGCTGCC GCTGATGATC TTCCACCAGG CGCAGTTGTT CGTCTGCGCC ACCTTGGCGC GCCGCTACGC CGCCCGCCCA CGCGTCGAGG ACGCCCTCGC CGGGTCCCGA CTAGGGGTTG GGGGCCAATG A
|
Protein sequence | MAARRLRLPL DPYLLALLAT VALAFLLPAR GGARTVLNGA TYAAVAGLFF LYGAKLSPRA VWTGLTHWRL QALVFASTYV LFPLIGLAIG VLARPLLPAD IVAGLVFLCL LPSTVQSSIA FTSIARGNVA AALCSASLSN MAGVVVTPLL VSLILPTSGG LSLSSLSDIG LQILLPFALG QMLRPWIGAG LGRHARITGL MDRGSILLIV YAAFGAGVVG GVWKRVSGHT LILILVFDLL ILAVVIALTT WASRRVRAST EDEIAIVFCG SKKSMASGIP MANILFAGHA VGLVVLPLMI FHQAQLFVCA TLARRYAARP RVEDALAGSR LGVGGQ
|
| |