Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1642 |
Symbol | |
ID | 4284228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 1800098 |
End bp | 1801036 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638141129 |
Product | bile acid:sodium symporter |
Protein accession | YP_756872 |
Protein GI | 114570192 |
COG category | [R] General function prediction only |
COG ID | [COG0385] Predicted Na+-dependent transporter |
TIGRFAM ID | [TIGR00841] bile acid transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0689619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.549741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCCA GCGCGATCGA CCAATTGCAG GTAATAGTTG ATGACCAATC GCGGTTGGGG ATCGCCGCGA TTCTCTTTGT GATGATGTTC TCGGTCGCGT TGACGCTGAG GCTGGAAGAC TTCGCATTGA TACGGCGCCA ACCTGGTCGC GTGCTGGGTG GGGTCGCGGT CCAGCTGGTT GGCCTGCCCC TGCTGACCCT TGGCCTTATC CTCCTCTTGT CGCCGCCGGC CAGCATCGCC CTGGGCATGT TGATCGTGGC GAGCTGTCCG GGCGGGAATG TCTCCAACCT GCTGACCCGC GCGGCCGCCG GGAATACGGC CTACTCGGTC ACCTTGACGG CCATTTCCAG CGTCTCTTCA GCGATCATGA CCCCGCTTTC AATCCTGTTC TGGTCCGGGC TCTATGCGCC GGCCGGTGCG CTGGTCCGAT CGCTGGACGT CGACCCTTTG CCCTTTTTTG CGCAGACGGC GGTTCTGCTC GCAGTTCCCC TGATCCTGGG CATGGCGCTC AATCAGTGGC GTCCAGCCCT GGCGGGTCGC CTGGCAGCGG TCCTCGGTCC CTTGGCGCTG GCCTGTATCG CGCTGCTGGT CGTCGTCGGC ATCGTCCAGA ACTGGGCCCT GATCCTTGCC ACCGGTGCTA TCATCATCCC CATCGTCGTT CTCCATAATG GGTCTGCCTT CGCGCTTGGC TGGCTGGGCG GGCGTGCCAT GGGCATGGAA GCGGCCCGCC GCCGGGCCCT GACCTTTGAA GTCGGCATCC AGAATTCCGG GCTTGGCCTG GTCATCCTCC TGAGCCAGTT CGAAGGTGTC GGCGGTGCCG CGGCCATTAT CGGCACCTGG AGCATCTGGC ACCTGGTGGG TGGATCACTG GTTGCGGGGT TGTTCCGCTG GATGGATTCA CGGACACTCC TCGCACACGC CAGAGAGCGT GACTCATAA
|
Protein sequence | MDASAIDQLQ VIVDDQSRLG IAAILFVMMF SVALTLRLED FALIRRQPGR VLGGVAVQLV GLPLLTLGLI LLLSPPASIA LGMLIVASCP GGNVSNLLTR AAAGNTAYSV TLTAISSVSS AIMTPLSILF WSGLYAPAGA LVRSLDVDPL PFFAQTAVLL AVPLILGMAL NQWRPALAGR LAAVLGPLAL ACIALLVVVG IVQNWALILA TGAIIIPIVV LHNGSAFALG WLGGRAMGME AARRRALTFE VGIQNSGLGL VILLSQFEGV GGAAAIIGTW SIWHLVGGSL VAGLFRWMDS RTLLAHARER DS
|
| |