Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4768 |
Symbol | |
ID | 5902230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5152528 |
End bp | 5153772 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641565288 |
Product | major facilitator transporter |
Protein accession | YP_001686386 |
Protein GI | 167648723 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.24612 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTATCG AGAAGGCGGC GATCATGACC TCCAGTCCAG TTCCCGACGC GGTCGCGCCG GCCAAGGTCC ATGCGGCGGG TCTGGTGCTG GCGGCCCTGG CCCTGGGCGG CTTCGCGATC GGCACCACCG AGTTCGCCTC GATGAGCCTG CTGCCCTATT TCGCCGCCAG CCTGGGCGTC GACGCCCCGA CCGCCGGCCA CGCGATCAGC GCCTACGCCC TGGGGGTCGT GATCGGGGCG CCGATCATCG CCGTGGCGGC CGCGCGCCTG CCGCGCCGGC TGATCCTGGT GGCGCTGATG GCGGTGTTCG CGGTCGGCAA CCTGCTCAGC GCCCTGTCGC CGAGCTTCGG CTGGATGTTG GTCTTCCGGT TCCTCAGCGG CCTGCCGCAC GGCGCCTATT TCGGCGTCGC GGCCCTGGTC GCCGCTGGCG TCTCGCCGCC CGAGCGCCGG GCGCGGGCGG TGGCCATGGT GATGATCGGC CTGACCGTGG CGACCATCGT CGGCGTGCCG CTGGCCAATG TCGTGGGCCA ATGGATCGGC TGGCGCTGGG GCTTCGTGAT CGTCGCGGCG CTGGCCATGA TGACCGCCAC GGCGGTCTGG CTGCTGGCGC CGCGCGACGC GGCCCATCCC GACGCCTCGC CGCTGCGCGA GCTGGGCGCC CTGGGTCGGG GACGGGTGTG GCTGACCCTG GGGATCGGGG CGATCGGCTT TGGCGGCATG TTCTGCGTCT ACACCTACCT GGCCTCGACC ATGGCCGAGG TCACCCACGC CTCGCCAGCC GCCCTGCCGA TGGTGCTGGC GGTGTTCGGG GCCGGCATGA CCGTCGGCAC CCTGGTCTGC GCCTGGGCCG CCGACCGCGC CCAGATGCCG GCCATCGGCG GCGTGCTACT GTGGAGCGCC GCGGCCCTGG CTCTCTATCC GATGGCGACG GGCAGCCTGT GGACCCTGGC CCCGGTGGTG TTCCTGATCG GTTGCGGCGG CGGACTGGGC GCGGTGCTGC AGACCCGGCT GATGGACGTG GCCGGCGACG CCCAAACCCT GGCGGCGGCG CTGAACCACT CGGCCTTCAA CTTCGCCAAC GCCCTGGGAC CGTGGCTGGG CGGCCTGGCC ATCGCCGCGG GCTACGGCTG GGCCTCGACC GGCTATGTCG GCGCGGCCCT GGCCCTGGGC GGCTTCGCGA TCTGGATCGT GGCGGCGGTC GATGTGCGGC GGACGCGGCG GGCGACGCTG GCGCCGGCGG AGTAG
|
Protein sequence | MRIEKAAIMT SSPVPDAVAP AKVHAAGLVL AALALGGFAI GTTEFASMSL LPYFAASLGV DAPTAGHAIS AYALGVVIGA PIIAVAAARL PRRLILVALM AVFAVGNLLS ALSPSFGWML VFRFLSGLPH GAYFGVAALV AAGVSPPERR ARAVAMVMIG LTVATIVGVP LANVVGQWIG WRWGFVIVAA LAMMTATAVW LLAPRDAAHP DASPLRELGA LGRGRVWLTL GIGAIGFGGM FCVYTYLAST MAEVTHASPA ALPMVLAVFG AGMTVGTLVC AWAADRAQMP AIGGVLLWSA AALALYPMAT GSLWTLAPVV FLIGCGGGLG AVLQTRLMDV AGDAQTLAAA LNHSAFNFAN ALGPWLGGLA IAAGYGWAST GYVGAALALG GFAIWIVAAV DVRRTRRATL APAE
|
| |