Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0868 |
Symbol | |
ID | 5898323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 924629 |
End bp | 925957 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641561351 |
Product | major facilitator transporter |
Protein accession | YP_001682497 |
Protein GI | 167644834 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.201883 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGGA CCGGCGCTGA CGTCAGCCTC AGCGAACGCG ACAAGGGCCG GGCCTTCGCC ATCCTGTTCG CCGTGCTGCT GTCCAGCGCC GCGGGCAACA CCGCCCTTCA AACCGTCCTG CCCGCCATTG GCCGCCAGGT CGGCATTCCC GACGTGCTGA TCAGCTCGAT CTTCTCGCTG TCGGCCCTGC TGTGGGGCGT GATGTCGCCG GTCTGGGCGC GGATGTCCGA CAAGCACGGC CGCAAGCCGA TGGTGGTGCT GGGCATGGCC GGCTTCGCGG TCTCTATGCT GGGCTTTGGC TTCTTCATCT TCATGGGGCT CAAGGGCCTG ATGGTCCCGC TGGCGGTGTT CGCCGGAGCG ACCCTCTCGC GGGCGATCTT CGGCCTGGTC GGCTCGGCCT CGAACCCGGC GGCCCAGGCC TATGTCGCCG ACCGCACCGC CCCCGTCGAC CGCACCAACG CCCTGTCGAC CATGGCCTCG GCCAGCGGGC TGGGCACGAT CCTCGGCCCG GCCGTGGCGC CGTTCCTGGT CTTTCCGCTG CTGACCCTGT CGGGGCCGAT GTTCGCGTTC GCGGCCATCG CCGTGGTGGT GCTGGTGCTG GTGATCCGCG GCCTGCCCGA ACGGCCCGAC GAGATCCCCG ACCGCGAGGG CGACAGGGCC AGGCAGCCAC GAGCGAGGGT GCGCTGGAAC GACCGGCGGA TCATGCCCTT CATCCTCTAC GGCTTCCTGC TGGCCAGCGC CCAGACCGTG AACCAGCAGA CCCTGGGCTT CATGGTCATC GACAAGCTGA ACATCTCGCC GGCCAAGGCC GCCGCCTTCG CGGGCGTGGC GATGATGGCC GGCGCCGTGG CCAGCCTGCT GGCCCAATGG GGCCTGATCC GCATGCTGCG CCTGACCCCG CGCATGCTGC TGTGGCTGGG CGCGGGCTGC GCGGCCGTGG GCAACCTGAT CGTCGCCTTC TCGCCGGACT ACCACACCCT GGTCGTCGGC TTCGCCCTGT GCAGCCTCGG CTATGGCTTC GCCCGCCCGG GCTTCACGGC TGGCGCCTCG CTGTCGGTGG GCCACGAGGA GCAGGGGGCC GTGGCCGGCG CGATCAGCGC CATCAACGGC GCCTCGGTGA TCATCGCCCC GGTGCTGGGC GTGGCGCTCT ACAAGTGGGC CCATCCCTCG CCCTACCTGA TGAACGTCGC GATCCTGGCG GGCCTGGCCA TCTACGCCCT GCTAAATCCC GTCATGCGCC GCGTGGGCGA CGCCGAGCAG GCCCGGGAAC GCCGCGACGA AAGCCAGGTC GTCGACGCCA GCTCGATCGA CGCGACGGGA CCGCACTAG
|
Protein sequence | MTGTGADVSL SERDKGRAFA ILFAVLLSSA AGNTALQTVL PAIGRQVGIP DVLISSIFSL SALLWGVMSP VWARMSDKHG RKPMVVLGMA GFAVSMLGFG FFIFMGLKGL MVPLAVFAGA TLSRAIFGLV GSASNPAAQA YVADRTAPVD RTNALSTMAS ASGLGTILGP AVAPFLVFPL LTLSGPMFAF AAIAVVVLVL VIRGLPERPD EIPDREGDRA RQPRARVRWN DRRIMPFILY GFLLASAQTV NQQTLGFMVI DKLNISPAKA AAFAGVAMMA GAVASLLAQW GLIRMLRLTP RMLLWLGAGC AAVGNLIVAF SPDYHTLVVG FALCSLGYGF ARPGFTAGAS LSVGHEEQGA VAGAISAING ASVIIAPVLG VALYKWAHPS PYLMNVAILA GLAIYALLNP VMRRVGDAEQ ARERRDESQV VDASSIDATG PH
|
| |