Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0047 |
Symbol | |
ID | 5897759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 59376 |
End bp | 61037 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641560530 |
Product | general substrate transporter |
Protein accession | YP_001681683 |
Protein GI | 167644020 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.252827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.746127 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCC AGGCTCAGGG CGAGGATCGC GCCTCGATGC GTACGGTGGT GGCGGCCTCG TCGGCGGGCA CCGCCTTCGA ATGGTACGAC TTCTTCATCT TCGGCAGCCT GGCCCAGGTG ATCTCCAAGA CCTTCTTCAC GGGCCTGTCG GAGACGGCCG GCTATGTCGC GGCCCTGGCC CTGTTCGGGG TGGGTTTCGC CTTCCGGCCG CTGGGCGCGC TGGTGTTCGG CAAGATCGGC GACCAGGCCG GCCGCAAGGG CGCGTTCCTG GCCACCGTGC TGCTGATGGG CGGCGCCACC TTCGCCATCG CCCTGCTGCC GACCTACGGG CAGGCCGGAA TCATTTCGCC GATCCTGCTG ATCCTGCTGC GCTGCGTGCA GGGCTTCGCC CTGGGCGGCG AGTATGGCGG GGCGGCGATC TATGTCGCCG AGCATTCGCC GCCCAACGAG CGCGGCTGGT CGACCTCCTG GGTGCAGACT TCGGCGGCGT TCGGCCTGTT CGGCGCCCTG CTGGTGATCC TGCTGACCCG CTGGCTGCTG GGCGTCCAAT TCGGCCCCGA GGCTTTCGAC GCCTGGGGCT GGCGCGTCCC GTTCGCCGTC TCGATCGGCC TGCTGGGCGT CTCGGTCTGG ATGCGCCTGA AGCTCAGCGA AAGCCCGGCC TTCGCTAAGA TGAAGGAAGA GGGCGAGGCC TCCAAGGCGC CCTATGCCGA GGCCTTCGGC CAGTGGAAAA ACCTCAAGCT GGTGCTGCTG GCCTTCTTCG CCATGATGTC GGCCCAGGGG GCGGTCTGGT ACACCAGCTT CTTCTACGTC CAGACCTTCA TGGAGAAGTT CCTCAAGGTC TCGCCCACCA CGATCAACGG CCTGATGATG GCGGCCACGG CGGTCAGCGC CGTCTTCTAC GTCGTGTTCG GCTGGCTGTC GGACAAGGTC GGCCGCAAGC CGGTGATGCT GGGCGGCATG ACCTTGGCCC TGGTCTTCTA TTTCCCTGGC TTCCACCTGC TCGAGCGCGC CGCCAACCCG GCCCTGGCCG AGGCCACGGT CCGGGCGCCG GTCACCGTGA CCGCCGACCC CAGGGACTGC GCCCTGCAGT TCGATCCGGT CGGCAAGGCC GCCTTCGTCT CGTCCTGCGA CATCGCCAAG AGCGTCCTGG CCAACGCCGG CATATCCTAC GCCAACCATG CCGGCCCCGC GGCCTCGGCC GCGGTGGTCC AGGTCGGCGA CACGCGAATC GTCTCCCAGA GCGCCAAGGG ACTGCCCCCC AAGGAGGCCA AGGCCGTGAA GACCGCCGGC GAGGCCGCGA TCAAGGCCGC CTTGGCCAAG GCCGGCTACC CCACCAAGGC CGACCCGGCG CGGATGAACT GGTGGGGCAT GTTCGGCGTG CTGTTCATCT TCGTGGTCGC CGCCACCGCC CTGTTCGGCC CCCTGGCCGC CTGCCTGGTC GAACTGTTCC CCACCCGAGT GCGCTACACT GCCCTGTCGC TGCCCTACCA CATCGGAACG GGGTGGATCG GCGGCTTCGT GCCGTTCAGC GCCTTCGCCA TCGTGGCGGC GGTGGGCGAT ATCTATGCCG GGCTTTGGTA CCCGGTGTTC TTCACCCTGA TCAGCGTGCT GACCACGCTG TTCCTGCTGC CCGAGACCAA GAACCGGTCC TTGGATCAGT GA
|
Protein sequence | MTTQAQGEDR ASMRTVVAAS SAGTAFEWYD FFIFGSLAQV ISKTFFTGLS ETAGYVAALA LFGVGFAFRP LGALVFGKIG DQAGRKGAFL ATVLLMGGAT FAIALLPTYG QAGIISPILL ILLRCVQGFA LGGEYGGAAI YVAEHSPPNE RGWSTSWVQT SAAFGLFGAL LVILLTRWLL GVQFGPEAFD AWGWRVPFAV SIGLLGVSVW MRLKLSESPA FAKMKEEGEA SKAPYAEAFG QWKNLKLVLL AFFAMMSAQG AVWYTSFFYV QTFMEKFLKV SPTTINGLMM AATAVSAVFY VVFGWLSDKV GRKPVMLGGM TLALVFYFPG FHLLERAANP ALAEATVRAP VTVTADPRDC ALQFDPVGKA AFVSSCDIAK SVLANAGISY ANHAGPAASA AVVQVGDTRI VSQSAKGLPP KEAKAVKTAG EAAIKAALAK AGYPTKADPA RMNWWGMFGV LFIFVVAATA LFGPLAACLV ELFPTRVRYT ALSLPYHIGT GWIGGFVPFS AFAIVAAVGD IYAGLWYPVF FTLISVLTTL FLLPETKNRS LDQ
|
| |