Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2369 |
Symbol | |
ID | 5899824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2571651 |
End bp | 2572994 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562860 |
Product | major facilitator transporter |
Protein accession | YP_001683994 |
Protein GI | 167646331 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAAT CATCCCATCC GGTATCGGCG TTCCGCGCCT GGCTGGCGCT GGCGATCATG ATCGCGGCGA TGTTGTATGC GCTCGTGGAT CGTCAGGTGT TCCTGCTGGT CGCCGCCGAG ATGAGCAAGA CGCTGCGTCT CAGCAACACG CAACTCGGCC TCATCCAAGG TGTGGGCTTC GCGGCCCTGA CCTTGCTCGG CGCCTATCCG ATCGCCTGGT TCGCCGACCG TTACGATCGA CGTTGGGTGC TCGCCATTTG CATTCTGTGC TGGGCCATGG GGACGGCGGC TTGCGGTCTT GCGAACGGCT TTTCGCCCTT GTTCATTGCG GCCGGGGCCG TGGCGGTCTC CGAAGCCGGC ATCGCCCCGA TCTTCATGTC GATGCTGCCC GAGCTGTTCC GCGGCCAAGC CAGGGTGACC GCAACCATGA TCTACTATGT CGCGGTCTCC CTGGGCATGG CGGCGGGCAT GTTCGTGGTC GGCGCCATGA TGGCGGCGGT CGATGCGCTC AAGCCGCTGC CCGGGTTCTT GAGCGGGCTG GAAAACTGGC GCCTGGCCTA TCTGGCGGCG GCGGCTCCAT TTCCGGTCTT GATCGCGATG ATCTTCTTCT TGCCGATCGG GCGAGTCCCG GGCGCGCGAG CGAAAGCGAC GGCCGCGCCG ATCACGCCGT TCTTGCGCGC GCACTTCAAG TCTGTGGCGC TGGTGTTCGG GGCGATGACC TTCTTCGCCC TGGGCGTGAC GAGCGTTCTG GCTTGGACGC CCGTGTCGCT GACCCGGATC TTCGGCCTGA GCCCCGCGTC TGTCGGCATG GTGCTGGGCG CGGTGATCGC CGCCGCGAGC GTCGCCGGGG TCACGGCTGG CAATTTCGTC ATGCCGCCCC TGCAGCGGCG GATCGGCTAT CGTGCCGCCC CTCGCATCGT CTGGGTGTCG CTGATCGCGT CGCTCCCGCT GGTCTGCCTC ATTCCCTTCG CGACGGCGCC CTGGCAGGTC TTCGCCTGTG TCGGCGTCCA GGTTTTCGCC TCGACGATCG CCGGGGCCTC GAGCGTCAGC CTGCTGCAGG ACTTGGCGCC CCCTGAAGTA CGGTCGCGCA TCATGGCGCT CCGCGCGATG ACCAATGGAC CGGCAATCGG CCTGGGCATA GCCGGCTCGG CGTTTCTGGG CGACGTCATC AAGGCGGGGC CGCAGAGCCT GTTCTGGGGC GGCCTGTGCA TAACCGTTCC GGCCTGGATC GCCACCATCG TGATGCTGCG GCTCGCCGAG AAGCCCTTCG AGGTTACGGC TCGTGAGAGC ACGGGCATGC GCAGCCCACT CGACTTTTCT GCGCCGACCA AAGACGTCGG TTAG
|
Protein sequence | MQQSSHPVSA FRAWLALAIM IAAMLYALVD RQVFLLVAAE MSKTLRLSNT QLGLIQGVGF AALTLLGAYP IAWFADRYDR RWVLAICILC WAMGTAACGL ANGFSPLFIA AGAVAVSEAG IAPIFMSMLP ELFRGQARVT ATMIYYVAVS LGMAAGMFVV GAMMAAVDAL KPLPGFLSGL ENWRLAYLAA AAPFPVLIAM IFFLPIGRVP GARAKATAAP ITPFLRAHFK SVALVFGAMT FFALGVTSVL AWTPVSLTRI FGLSPASVGM VLGAVIAAAS VAGVTAGNFV MPPLQRRIGY RAAPRIVWVS LIASLPLVCL IPFATAPWQV FACVGVQVFA STIAGASSVS LLQDLAPPEV RSRIMALRAM TNGPAIGLGI AGSAFLGDVI KAGPQSLFWG GLCITVPAWI ATIVMLRLAE KPFEVTARES TGMRSPLDFS APTKDVG
|
| |