Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4108 |
Symbol | |
ID | 5902567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4464748 |
End bp | 4466052 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564629 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_001685730 |
Protein GI | 167648067 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0814495 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.311636 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCTGC TTCGGCCGTT GAAGACGCTG TATGTCCAGG TGCTGATCGG CATCGCCCTG GGGGTGCTAG TCGGGGCGCT GTGGCCGGAG GTCGGGGTGG CGCTGAAGCC GCTGGGCGAT GCGTTCATCA AGCTGGTCAA GCTGGTGATC GCCCCGGTGA TCTTCCTGAC CGTGGCCAGC GGCATCGCCC ACATGGGCGA CATCAAGGCG TTCGGCCGGG TGGGGGTCAA GGCCCTGCTC TATTTCGAGG TGGTCTCGAC CCTGGCCCTG GCGGTCGGGC TGGTCGTCGG CCACATCCTG CAGCCGGGCC ACGGCTTCAA CATCGACCCG GCCACGCTGG ATCCGAAGAT CGCCGCCGGC TACCTGGAGA AGGCCCATCA CGGCGAGGGG CTGGTTCCCT ATCTGCTGCA CCTGATCCCC GACACCTTCT TCGGAGCCTT CGCCGAGGGC AACCTGCTGC AGGTGCTGGT GATCTCGGTG CTGACGGGTT TCGCCTGCAC GCGCATGGGT CCGTTCGGCG ACCGTGCGGC GGCGGCGATG AGCGACATGG CCAAGCTGTT CTTCGGCATC ATCCACGTGG TGGTCAAGTT GGCGCCGCTG GGGGCGTTCG GGGCCATGGG CTTCACGATC GGCAAGTACG GGCTCGCCAG CCTGGTGCAG TTAGGGGCGC TGGTGGCCAC CTTCTACATC ACCGCCCTCC TGTTCGTGCT GGTGGTGCTG GGGGCGATCG CCTGGGCCTG CGGCTTCTCG ATCCTCAAGT TCCTGGCCTA TATCCGCGAG GAGTTGCTGA TCGTGCTGGG CACCAGTTCG TCGGAGAGCG CCCTGCCCCA GCTGATCGAG AAGCTGGAGC GGCTGGGCGC CCGCAAGTCG GTGGTCGGGC TGGTGGTGCC CACCGGCTAC AGCTTCAACC TCGACGGCAC CAACATCTAC ATGACCCTGG CCACCCTGTT CCTGGCCCAG GCCACCAACA CCCACCTGAG CTGGATCCAG ATGGCGAGCC TGCTGGGCGT CGCCATGCTG ACCTCCAAGG GGGCCAGCGG GGTGACCGGG GCCGGCTTCA TCACCCTGGC CGCCACCCTG GCCGTGGTCC CCGACATCCC GATCGCCGCC CTGGCGGTGC TGGTCGGGGT CGACCGGTTC ATGAGCGAGT GCCGGGCCCT GACCAATTTC GTCGGCAACG GCGTGGCCAC CCTGGTGGTG GCCCGCTGGG AGGGCGCCCT GGATCGCGAC CGGTTGGCGC GGGAACTGAC CCGAGGTCCC AACGTCCCGC CCGTCGAGGT CGTCGAGGAA CTTCCCGCCG CCTGA
|
Protein sequence | MGLLRPLKTL YVQVLIGIAL GVLVGALWPE VGVALKPLGD AFIKLVKLVI APVIFLTVAS GIAHMGDIKA FGRVGVKALL YFEVVSTLAL AVGLVVGHIL QPGHGFNIDP ATLDPKIAAG YLEKAHHGEG LVPYLLHLIP DTFFGAFAEG NLLQVLVISV LTGFACTRMG PFGDRAAAAM SDMAKLFFGI IHVVVKLAPL GAFGAMGFTI GKYGLASLVQ LGALVATFYI TALLFVLVVL GAIAWACGFS ILKFLAYIRE ELLIVLGTSS SESALPQLIE KLERLGARKS VVGLVVPTGY SFNLDGTNIY MTLATLFLAQ ATNTHLSWIQ MASLLGVAML TSKGASGVTG AGFITLAATL AVVPDIPIAA LAVLVGVDRF MSECRALTNF VGNGVATLVV ARWEGALDRD RLARELTRGP NVPPVEVVEE LPAA
|
| |