Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2501 |
Symbol | |
ID | 5899956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2713886 |
End bp | 2714860 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562992 |
Product | bile acid:sodium symporter |
Protein accession | YP_001684126 |
Protein GI | 167646463 |
COG category | [R] General function prediction only |
COG ID | [COG0385] Predicted Na+-dependent transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTCAAAG ACCTCCTCGC CAAGCTGAAG ATCGATTCCT ACATCCTCCT GCTGATCGGG ATGGTGATCC TCGCCTCGGT CCTGCCGGTG CGCGGCGAGG CCGCGACGAT CCTCGGCTGG GTGGTCAAGA TCGCCATCGC CCTGCTGTTC TTCCTGCACG GCGCCAAGCT GTCGCGCGAG GCCGTGGTCG CGGGCCTGAC CCACTGGCGG CTGCACCTGA CCATCCTGGC CTTCACCTTC GTGCTGTTCC CGGCGCTGGG CCTCTTGATC AGCAAGTCCG GCCTGCTGTC GCCCACCCTG TCGACCGGCA TGCTGTTCCT GTGCTGCCTG CCCTCCACCG TGCAGTCGTC GATCGCCTTC ACCTCGATCG GACGCGGCAA CGTCGCCGCC GCCGTCTGCG CGGCCTCGGC CTCGAACCTG CTGGGCATCT TCCTGACCCC CGTGCTGGTC GGCCTGCTGA TGCACGCCCA CGGCGACGTC GGCGGCTGGG ACTCGATCCA GTCGATCATC GTCCAGTTGC TCGTGCCCTT CGTCGCCGGT CAGTTGGTCA GGCCGTGGGT CGGCGCCTGG ATCGAGCGCC ACAAGACCTT GGTCGGCCGC GTTGATCGGG GTTCGATCCT GCTGGTGGTC TATTCGGCCT TCAGCGCCGC CGTGGTCGGC GGGATCTGGA AGATCGTCTC GATCCCCGAG CTCGGCGTCC TGCTGGTCGC CTGCTGCGTG CTGCTGGCCG TTGTCGTAGC GGCGACCATG TTCGGGGCCC GCGCCCTGGG CTTCTCCAAG CCCGACGAGG TGGCCATCGT GTTCTGCGGC TCCAAGAAGA GCCTGGCCAC CGGCGTGCCC ATGGCCGGCA TCCTGTTCCC GGGCGCCACG GCTGGCATCC TGGTTCTGCC GCTGATGCTG TTCCACCAGA TCCAGTTGAT GGCCTGTTCG GTCCTGGCCC AACGCTACGG CGCGCGGCCG GCGGATGAGG CCTGA
|
Protein sequence | MFKDLLAKLK IDSYILLLIG MVILASVLPV RGEAATILGW VVKIAIALLF FLHGAKLSRE AVVAGLTHWR LHLTILAFTF VLFPALGLLI SKSGLLSPTL STGMLFLCCL PSTVQSSIAF TSIGRGNVAA AVCAASASNL LGIFLTPVLV GLLMHAHGDV GGWDSIQSII VQLLVPFVAG QLVRPWVGAW IERHKTLVGR VDRGSILLVV YSAFSAAVVG GIWKIVSIPE LGVLLVACCV LLAVVVAATM FGARALGFSK PDEVAIVFCG SKKSLATGVP MAGILFPGAT AGILVLPLML FHQIQLMACS VLAQRYGARP ADEA
|
| |