Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4014 |
Symbol | |
ID | 5901476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4347030 |
End bp | 4348346 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564535 |
Product | major facilitator transporter |
Protein accession | YP_001685637 |
Protein GI | 167647974 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.242843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGC CCTCCCCCCT TCTTCAAGCC GGACTGGACG CCCCCGCACG CCCCGCGGCG TCGGGCCGCG GCGCCTGGCT GGTCCTGGCC ATGCTGTGGT TCGTCTATGT CCTGAACTTC CTCGACCGAC AGCTGATGTC GATCCTGGCC AAGCCGATCC AGGATGCGCT GCATGTCACC GACGGCCAGC TGGGCCTGAT CGGCGGCCTC TATTTCGCGA TGTTCTACTG CTTCATCGCC ATTCCGGTGG GATGGCTGGC CGACCGCACC AACCGTGTGG CGGTCCTGTC CCTGGCCTGC GGAATCTGGA GCATGGCGAC GGCCGCCTGC GGATTCTCGG CCAACTACGC GCAGTTCGCC GTCTCTCGCA TGACCGTCGG CTTTGGCGAG GCCGGCGGCG TTCCCCCCTC CTACGCCATC ATCTGCGACT ATTTTCCGCC CGGACAGCGC GGCACGGCAT TGAGCGTCTA CAACCTGGGT CCGCCGGTCG GGGCGGCGCT GGGCATCGCC TTCGGCGCGG CGATCGCCGC GGCCTTCAAC TGGCGCTACG CCTTCGTGGT GCTGGGCCTC GTCGGCGTGC TGGCGGCGAT CGCCCTGCCC CTGGTGGTGC GCGAGCCGCC CCGCGGCGGC ATGGATCCGG TCGGCGCGGC GCCGCCAATC CAGAAGGCCA GCTTCTGGTC GACCCTGACG ATGTTCTTCT CCCGGCCGCC GCTGGTGCTC GCGGCGCTCG GCAGCGGGGC GACGCAGTTC GTCACCTATG GCCTGGGCAA CTTCGCCACC CTGTTCCTGA TGCGCGAAAA GGGCATGACC CTCTCGGAGG TCGCCCTCTG GTACGCGCTG GTGGTGGGCG TCGGCATGAG CGCGGGGATC TTCGTGTCGG GCCGGGTGAT CGACCGCTTC ACCCGGCGGT CCAAGATCGC CTACGCCCTG GCGCCGGCGA TCTCCCTGAC CCTGGCCATT CCGTTCTATA TCGGCTTCGT CTGGGCGCCG ACCTGGCCCC TGGCCCTGGC GTTCCTGATC GGGCCGACCT TCCTCAACTA CTTCTACCTG TCCTCGTCGG TGGCGCTGGT GCAACAGGAG GTGCGGCCGG ACCAGCGGGT GATGGCCGGG GCGCTGCTGC TGCTGGTGAT GAACTTCATC GGCCTGGGCC TGGGCCCCAC CTATGTCGGC GCCGCCAGCG ACTTTTTCCG CGCCAGCCAT CCGCACAACT CGCTGCAGAT GGCGCTCTAC ACCCTGATCC CGTTCTACGT CCTGGCGGTG GGGCTGTTCC TATGGCTGGC CCGGATCTTC GCCCGGGCCG AGGGCGGCGA GGCCTAG
|
Protein sequence | MTAPSPLLQA GLDAPARPAA SGRGAWLVLA MLWFVYVLNF LDRQLMSILA KPIQDALHVT DGQLGLIGGL YFAMFYCFIA IPVGWLADRT NRVAVLSLAC GIWSMATAAC GFSANYAQFA VSRMTVGFGE AGGVPPSYAI ICDYFPPGQR GTALSVYNLG PPVGAALGIA FGAAIAAAFN WRYAFVVLGL VGVLAAIALP LVVREPPRGG MDPVGAAPPI QKASFWSTLT MFFSRPPLVL AALGSGATQF VTYGLGNFAT LFLMREKGMT LSEVALWYAL VVGVGMSAGI FVSGRVIDRF TRRSKIAYAL APAISLTLAI PFYIGFVWAP TWPLALAFLI GPTFLNYFYL SSSVALVQQE VRPDQRVMAG ALLLLVMNFI GLGLGPTYVG AASDFFRASH PHNSLQMALY TLIPFYVLAV GLFLWLARIF ARAEGGEA
|
| |