Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1092 |
Symbol | |
ID | 5898547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1158495 |
End bp | 1160171 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641561574 |
Product | major facilitator transporter |
Protein accession | YP_001682720 |
Protein GI | 167645057 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAGCG ACGCCGCGAA ACCGGTGAAA GGCGCCAAGG ATGGGCGCGC CGTGGGCAGG AAGGACGCCC TGGTCATCGG AGCCGCTTCG GTCGGCACCG TGTTCGAGTG GTACGACTTC TACCTCTACG GATCGCTGGC CACCTACATC ACCAAGCACT TCTTCTCAGG CGTCAACGAG ACCACGGGCT ACATCTTCGC CCTGCTGGCC TTCGCCGCCG GCTTCGCGGT GCGGCCGTTC GGGGCCTTGG TGTTTGGCCG GCTGGGCGAT CTGTGGGGTC GCAAGAACAC CTTCCTGGTC ACCATGCTGC TGATGGGCCT GTCGACCTTC GTGGTCGGCC TGCTGCCCAG CTACGCCCAG ATCGGCATCG CCGCGCCCAT CGCCCTGGTG GTGATGCGCC TGGTGCAGGG CCTGGCCCTG GGCGGCGAAT ACGGCGGGGC GGCCACCTAT GTGGCCGAGC ACGCTCCGGC CGGACGGCGG GGGTTCTACA CCAGCTTCAT CCAGGTGACG GCCACCTTCG GCCTGTTCCT CAGCCTGGTG GTGATCCTGC TGACCCGCCA GGCCGTCGGC GAGAGCAGCT TCGAGACCTT CGGCTGGCGC ATCCCGTTCC TGATCTCGGT GCTGCTCCTG GGCGTGTCGC TGTGGATCCG CCTGCAACTG GCCGAGAGCC CCTCGTTCCA GCGCATGGTC GACGAGGGCA AGGGCAGCAA GAAGCCGCTG GCCGACTCGT TCGCCAAGTG GGGCAATCTG AAGATCGTCA TCCTGGCCCT GGTCGGTCTG ACCGCCGGTC AGGCGGTGGT CTGGTACACC GGCCAGTTCT ACGCCCTGTT CTTCCTGGAG AAGATGCTCA AGGTCGATGG CGGCACCACC AACCTGCTGG TCGCCGCGGC CCTGCTGATC GGCACGCCGT TCTTCGTGGT CTTCGGCTGG CTGTCGGACA AGATCGGACG CAAGCCGATC ATCATGCTGG GCTGCATCCT GGCGGCCCTG ACCTATTTCC CGCTGTTCAA GACCCTGACC ACGGCGGCCA ATCCGCAGCT GGCGGCGGCC GTGGCCAGCG CCCCGGTGAC CGTGGTGGCC GATCCGGCCG ACTGCTCGTT CCAGTTCGAT CCGGTCGGCA AGACGGTGTT CAACCGTTCG TGCGACCTGG CCAAGTCCTA CCTGGCCAAG GCCGGCGTCA CCTACGCCAA CCAGGCCGCT CCGGCCGGCG CGGTCGCCCA GGTCAGGATC GGCGCGGCGA CGATCGCCTC GTTCCCCGGC CAGACCCTCG ACAAGGCCGC CTTCAAGGCT CGCAAGACGG CTTGGGAAAA GGAACTGGGC GCGGCGCTGA AATCGGCCGG CTATCCCGCC AAGGCCGATC CCGCCCGCAT CGACAAGCCG CTGGTGATCG GCGTCCTGGC GATCCTGGTG CTCTACGTGA CCATGGTCTA CGGCCCGATC GCGGCCATGC TGGTCGAGCT GTTCCCGACC AATATCCGCT ACACCTCGAT GAGCCTGCCC TATCACATCG GCAACGGCTG GTTTGGGGGC TTCCTGCCGA CCACGGCCTT CGCCATGGTC GCGGCCACGG GCAATATCTA TTACGGCCTC TGGTACCCTA TCGTGGTGGC GGCGGTGACC GCCGTGGTCG GCATCCTGTT CCTGAAGGAA ACCAAGGACG TCGACATCGA GGCGTAG
|
Protein sequence | MGSDAAKPVK GAKDGRAVGR KDALVIGAAS VGTVFEWYDF YLYGSLATYI TKHFFSGVNE TTGYIFALLA FAAGFAVRPF GALVFGRLGD LWGRKNTFLV TMLLMGLSTF VVGLLPSYAQ IGIAAPIALV VMRLVQGLAL GGEYGGAATY VAEHAPAGRR GFYTSFIQVT ATFGLFLSLV VILLTRQAVG ESSFETFGWR IPFLISVLLL GVSLWIRLQL AESPSFQRMV DEGKGSKKPL ADSFAKWGNL KIVILALVGL TAGQAVVWYT GQFYALFFLE KMLKVDGGTT NLLVAAALLI GTPFFVVFGW LSDKIGRKPI IMLGCILAAL TYFPLFKTLT TAANPQLAAA VASAPVTVVA DPADCSFQFD PVGKTVFNRS CDLAKSYLAK AGVTYANQAA PAGAVAQVRI GAATIASFPG QTLDKAAFKA RKTAWEKELG AALKSAGYPA KADPARIDKP LVIGVLAILV LYVTMVYGPI AAMLVELFPT NIRYTSMSLP YHIGNGWFGG FLPTTAFAMV AATGNIYYGL WYPIVVAAVT AVVGILFLKE TKDVDIEA
|
| |