Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2380 |
Symbol | |
ID | 4071378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2814134 |
End bp | 2815558 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984396 |
Product | major facilitator transporter |
Protein accession | YP_591455 |
Protein GI | 94969407 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0389897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.03889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTGGAGC CGCGAGGCAC CAGCGCGCAT CCAAACGGTA AGATGGGAGA GATGGCGAGC CATACCCCGG CGGGAGAAGG AATTCAAAAA TCGAAATTTC TGAATCGGCT GCCGGCGCTG AAGGTCCGAA ACTTTCAGCT CTTTTTTGCG GGACAGCTGA TTTCGCTGAT CGGCACGTGG ATGGACAACG TGGCCGAGGC GTGGTTGATC TACCGCCTGA CCGGTTCGTC GTTGAAGCTT GGGACGGTCG GGTTTTGCAG CCAGATCCCG GTGTTCCTGT TCGCGCCGCT GGGCGGAATT GTCGCCGACC GATATAACCG CCACAAAATC ATCATCGCCA CGCAGGCGAC TTCGATGGTG CTGGCGGGCA TTCTTGCAAT CCTTACGCTT ACACATCGGG TGCAGGTCTG GCACGTGTTC CTGCTGGCAG CGCTGATGGG TGTGGTAAAT GCATTCGACA TTCCCGCGCG TCAGGCCTTC CTCTCCGACA TGGTCGGTCG CGAAAATCTG ATGAATGCCA TCGCGCTGAA TTCCTCGATG TTTAACGGAG CACGCATTGT TGGGCCGGCA GTGGCCGGCA TTTTGGTGGC AAGCATTGGT GAGGGCTGGT GTTTTGGCGC GAATTCGCTC AGCTATATCG CGGTCATCAC CGGGCTGCTC ATGATGAAGC TGAATCTTCC AGTCCGCATT GCGAGCGGGA AGTCGCCGCT GCAGGACATC GTCGAGGGGT TCCAGTTCGT CAAGGAAGCC GCGCCCATCC GGACGCTGCT CCTACTGCTT GGATTGGTGA GTTTAGTCGG CATGCCTTAC TCCGTGCTGA TGCCGATTTT CGCGGACCAC ATTCTGCATG GCGGCGCGAG AGGGCTGGGC ATCCTGATGG GCGCAACCGG CGTTGGTGCG CTCGGCGGAG CGTTAACGCT CGCATTGAAG AATGGTCTTA AGGGGATTAG CCGGATTATC AGCTACTGTG CATTCGGCTT CGGCACGAGT TTGATCCTGT TTTCGTTCTC GCGCTGGTTC TGGCTCTCCG CGGCGCTCCT GATCCCGGTG GGCTACTCGA TGATGGTGCA GATGGCGAGC TCAAACACGC TGCTGCAATC CATGACGCCA GATCGGCTGC GCGGGCGGGT GCTCGCTGTG TATTCGATGA TGTTCATGGG TATGGCGCCG TTCGGAGCGT TATTCGCGGG AGCGATCGCT GAGCGCATCG GTGCGCCTTG GACAGTGGCA GTCGGTGGAG TCGCCTGTAT TTGCGGCGGC TTGTTCTTCC GGAGGAACCT GGCTACCTTC CGCGATGGGG CTCGCAAGAT GGTCCTCGCA CAACAGATGG TCGGTGGCGA ACCGGCACCG GAAGTCACGG CGGGATCGCT GGTGCCAGCG ACCGATGCGG AACTCGGCGA GGAGCCGATC AGTTCTACGT CTTAA
|
Protein sequence | MLEPRGTSAH PNGKMGEMAS HTPAGEGIQK SKFLNRLPAL KVRNFQLFFA GQLISLIGTW MDNVAEAWLI YRLTGSSLKL GTVGFCSQIP VFLFAPLGGI VADRYNRHKI IIATQATSMV LAGILAILTL THRVQVWHVF LLAALMGVVN AFDIPARQAF LSDMVGRENL MNAIALNSSM FNGARIVGPA VAGILVASIG EGWCFGANSL SYIAVITGLL MMKLNLPVRI ASGKSPLQDI VEGFQFVKEA APIRTLLLLL GLVSLVGMPY SVLMPIFADH ILHGGARGLG ILMGATGVGA LGGALTLALK NGLKGISRII SYCAFGFGTS LILFSFSRWF WLSAALLIPV GYSMMVQMAS SNTLLQSMTP DRLRGRVLAV YSMMFMGMAP FGALFAGAIA ERIGAPWTVA VGGVACICGG LFFRRNLATF RDGARKMVLA QQMVGGEPAP EVTAGSLVPA TDAELGEEPI SSTS
|
| |