Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3774 |
Symbol | |
ID | 5901236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4090480 |
End bp | 4091766 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564297 |
Product | major facilitator transporter |
Protein accession | YP_001685399 |
Protein GI | 167647736 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.108427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGGCT GGTACCGCGA GGCGGACCCC AAGACGCGCC GCGTCTTCTG GACCTGCGGC GCCGGCTGGG CGATGGACAC CGCCGACGGG CTGGTTTTCC AGTACTTGAT CCCGGTCCTG ATGGTCGCGT TCGGCATGAC GCTGGCCCAG GCGGGCTATA TCGCCAGCGC CAACTACGTG GCCGCCGCGG TCGGGGGCTG GCTGGGCGGC TGGCTGAGCG ACCGCTTTGG CCGCGCGCGG ATCCTGCAGC TGACGATCCT GTGGTTCTCG GTCTTCTCGT TCCTGTCGGG CTTTGCTCAG ACCTACGAGC AGCTGCTCGC GGCGCGGGTG CTGCAAGGCA TCGGGTTTGG CGCGGAATGG GCGGTCGGCG CCGTGCTGCT GGGTGAGATG ATCGCGCCGA AGCATCGCGG CAAGGCGCTT GGCGTGGTGC ACAGCGGCGC GGCGATCGGG TCCGGCATCG CGGCCTTGCT GGCGGGTCCG TTCGCGGCGG CGTTCCCGAG CGACATCGGC TGGCGCGCGG TGTTCTGGAT CGGTCTCCTG CCCGCCATAC TGGTGTTCTT CGTTCGCCGG GGTTCGGACG ACCCCGAGAT CTATCGGGCC GCGGCGCGGC GCGCGGCCGA GACCGGCAAC AGGCCGAAGA TCGCCGACAT CTTCGGTCGA CGGGTGGTGC GCACCACCAT ACTGGCGTCA TTGCTCTCGC TGGGCACCCA GGGCGCGGCG TTCGCGATCA GCAACTATCT TACGTCCTTC CTGACGATCG AGCGCCACAT GACCGTCTCG ATGGCCGGAA TGTGCGTGCT GTTCAACAGC CTGGGCGGGT TCTTCGGCTT CCTGGTCAAC GCCTACATCT CCGACCATGT CGGCCGCCGG GGCGCCTTTC GTCTGTTCGG GGCCGGCTTC ATCCTGACCG CGTCGGTCTA TCTGTTCGCG CCTCTGGGCA ACTCGCCCGC CATCCTGATC CCGGCCGGCC TGATCTACGG TTTCTTCCAG TTCGGGATCT ACGCCTCGTT CGGACCCTAC TTCACCGAGC TGTTCCCGAC CGAGGTGCGC GCCACCGGAC AGGCCTTCGC CTATAATTTC GGTCGCGGGG GCGCCGCGCT GTTCATCACC GGAGTCGCCC TGCTGGCGGG GACGCTGCCG CTGAGCGCGG CGATGGCCGC CGTGGCGATC ACCGGCATGG CGCTCTCGAT CGCGGCGACC CTGGCCCTGC CGGAGACTGC GGGGCGCGCG CTGCATAGTC TTGGCGACAT AGACGCCCGT GAACTGGCCG GCGTTCCGCC CGACTGA
|
Protein sequence | MMGWYREADP KTRRVFWTCG AGWAMDTADG LVFQYLIPVL MVAFGMTLAQ AGYIASANYV AAAVGGWLGG WLSDRFGRAR ILQLTILWFS VFSFLSGFAQ TYEQLLAARV LQGIGFGAEW AVGAVLLGEM IAPKHRGKAL GVVHSGAAIG SGIAALLAGP FAAAFPSDIG WRAVFWIGLL PAILVFFVRR GSDDPEIYRA AARRAAETGN RPKIADIFGR RVVRTTILAS LLSLGTQGAA FAISNYLTSF LTIERHMTVS MAGMCVLFNS LGGFFGFLVN AYISDHVGRR GAFRLFGAGF ILTASVYLFA PLGNSPAILI PAGLIYGFFQ FGIYASFGPY FTELFPTEVR ATGQAFAYNF GRGGAALFIT GVALLAGTLP LSAAMAAVAI TGMALSIAAT LALPETAGRA LHSLGDIDAR ELAGVPPD
|
| |