Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5098 |
Symbol | |
ID | 5897368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | + |
Start bp | 15884 |
End bp | 17020 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641555201 |
Product | major facilitator transporter |
Protein accession | YP_001676532 |
Protein GI | 167621747 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCTGG CGGTGCTCAT CCTGACCCTC AGCACCTTCG CCTTGGGGAC CTCCGAATTC GTGATCATGG GCCTGCTGCC CGGCGTGGCG CGTGATCTGG ACGTTTCGAT CCCCGCCGCC GGCTGGATCG TCACCGCTTA TGCCTTGGGC ATCGCGTTGG GAGGGCCGAT CATGGCCCTT GCCACCGCGC GCATGCCTCG TAAGCGCGCG CTTCTGGTGT TGATGGGCCT GTTCGTTTTG GGCAACGCCC TTTGCGCCCT GTCGCCGAAC TATAGCTTGC TGCTGCTGGC GCGCGTCGTC ACAGCGATGG GTCAGGGATC ATTCTTTGGG ATCGGCGCGG TGCTGGCCGC CAGCCTGGTG CCCGACCATC GCAAAGCCAC GGCCATCGCC ACGATGTTCG CGGGCTTAAC CCTGGCCAAC GTCGCTGGCG TGCCACTGGG CACCGCGCTG GGTAATTGGG CCGGCTGGAG AGCGCCCTTC TGGGCGATCA CCGCCTTTGG CCTGCTGGCC CTGGTGGGTC TTGCAGCCAT CCTGCCGCTT CGCCGCGATG AGGAACACGC CGATTTCATC GCCGAGTTTC GGGCGCTCAG CGATGGCCGG ATCTGGGCGG CGCTCGGCAC GACGGTGTTG TTCACCGCGT GCGCCTTTCC ACTGTTCACC TACATCGTGC CAATGCTCGA GGACGTCACG GGCGTCTCCG CCGCCGGCGT GACGATCAGC CTGTTGTCGG TCGGCGTCGG GTTGACGCTG GGCAATTTCC TAGGCGGGCG CCTGGCCGAT TGGAATCTGG CGCGCGCCCT GGCGCTGATC GCCGTCTCCA TCGCCGTGGT GTCCCTGGCT CTGCGCTGGA CGAGCCCCTA CCTGATCCCG GCCCAGATCA ACTGGTTCCT GTGGGGGGCA GCGACGTTCG CAGCCATCCC CGCTTCCCAG GTCAACGTGA TGCGGTTTGG TCAGGCCGCG CCAAACCTGG TCTCGACCCT CAACATCTCG GCCTTCAACA TCGGCATCGC CACAGGCGCT TGGATCGGCG GCACGGTCCT GAGCGTCAGT CACAACCTGC TGGGCATTCC GGTGGCGGCG GCGGTCGTGG CGGTGCTGGC GTGGATCGCC GTGCTGGCCG CGGGGCGCCT ACGCTAG
|
Protein sequence | MPLAVLILTL STFALGTSEF VIMGLLPGVA RDLDVSIPAA GWIVTAYALG IALGGPIMAL ATARMPRKRA LLVLMGLFVL GNALCALSPN YSLLLLARVV TAMGQGSFFG IGAVLAASLV PDHRKATAIA TMFAGLTLAN VAGVPLGTAL GNWAGWRAPF WAITAFGLLA LVGLAAILPL RRDEEHADFI AEFRALSDGR IWAALGTTVL FTACAFPLFT YIVPMLEDVT GVSAAGVTIS LLSVGVGLTL GNFLGGRLAD WNLARALALI AVSIAVVSLA LRWTSPYLIP AQINWFLWGA ATFAAIPASQ VNVMRFGQAA PNLVSTLNIS AFNIGIATGA WIGGTVLSVS HNLLGIPVAA AVVAVLAWIA VLAAGRLR
|
| |