Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1131 |
Symbol | |
ID | 5898586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1198246 |
End bp | 1199481 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641561613 |
Product | putative 3-hydroxyphenylpropionic transporter MhpT |
Protein accession | YP_001682759 |
Protein GI | 167645096 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000122679 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCAAGG GCGAGGCATC CGCCGTGCGG GCTCCGGGCG GAACGGCGGC GATCATCGCC TGCTGCGCCC TGGCCCTCTT CGAGGGCTTT GATCTGCAGG CCGCGGGCGT CGCCGCCCCG CGCCTGGCTC CCGATCTTGG ACTTGGACCG GAGGCGCTCG GCTGGTTCTT CAGCATCAGC ACCTTTGGCC TCATGCTCGG CGCGGCGGTC GGCGGAAGGC TCTCCGATCG CTACGGGCGC AAGCCTGTTC TGGTGGTTTC CGTCGTTGTG TTCGGGATCC TGTCGGCCCT CACCGGCCTG GCCCAGACCC AAGAGCACCT GCTGGTCGCC CGGTTCCTGA CCGGGGTCGG CCTGGGCGGC GCCCTGCCCA ACCTCATCGC CATCGTCGCC GAAAGCGCCG GCGAACAACG GCGCAGCCGG GCGGTGGGCC TGCTCTATGC CGGCCTGCCC TGCGGCGGCG CCCTGGCCAG CCTGGTCAGC CTGGCCGGCG CCGAGCCTTC CGACTGGCGG ATGATCTTCT ATGTCGGCGG CCTTGGCCCG CTGCTGATCC TGCTCGCGAC CGCCCGCCAC CTGCCCGGCG CGACGCAGCC GCCGACCATC GCCGCCCCCG GCCTCGCGCC GCCAAAGGCG GGCTTCGTCG AGGCCGCCGT CGGCGAAGGC CGCGCCATCA CCACCTTGCT GCTGTGGACC GTCTTCCTGC TGGCCCTGCT GATCATGTAC CTGCTGCTCA GCTGGTTGCC TTCGCTGCTG ATCGGCCGGG GCCTCAGCCG CCCGGACGCC GGCCTCGTGC AGATCGCCTT CAACCTGGCG GGGGCCGCGG GAAGCGTGGC GGCCGGCTGG CTGATGGACC AGCGCGGCTG GCGGCTGGCG ACCATCGTCG GCGTGTTCGC CGCGGCGGCG GCCTCGGTTC TGGTGCTGGC GAACGCGCCG GTGTCCCTGA TGATCTCGCT GCTGGTCGGC GCGGCGCTGG GGGCCACGGT GTCGGGCGTG CAGTCGGTGG TCTATGGCCT GGCGCCGGGC TTCTACCCCA GGCGGCTGCG GGGAACCGGG GTGGGCGCGG CGGTGGTCAT GGGCCGGTTG GGATCGGCGC TGGGTCCGCT GTTGGCCGGC GCGCTGCTGG CGACCGGGCG CTCGCCCGCG CAGGTGCTGC TCACCCTCCT GCCCGTCCTC GCCCTTGGGG CCGTCCTCTG CGTATGGCTG GCCAACCGGC CGCTGGCCCT CGGGGACGAG ACCTAA
|
Protein sequence | MVKGEASAVR APGGTAAIIA CCALALFEGF DLQAAGVAAP RLAPDLGLGP EALGWFFSIS TFGLMLGAAV GGRLSDRYGR KPVLVVSVVV FGILSALTGL AQTQEHLLVA RFLTGVGLGG ALPNLIAIVA ESAGEQRRSR AVGLLYAGLP CGGALASLVS LAGAEPSDWR MIFYVGGLGP LLILLATARH LPGATQPPTI AAPGLAPPKA GFVEAAVGEG RAITTLLLWT VFLLALLIMY LLLSWLPSLL IGRGLSRPDA GLVQIAFNLA GAAGSVAAGW LMDQRGWRLA TIVGVFAAAA ASVLVLANAP VSLMISLLVG AALGATVSGV QSVVYGLAPG FYPRRLRGTG VGAAVVMGRL GSALGPLLAG ALLATGRSPA QVLLTLLPVL ALGAVLCVWL ANRPLALGDE T
|
| |