Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4478 |
Symbol | |
ID | 4070961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5313597 |
End bp | 5314820 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986517 |
Product | major facilitator transporter |
Protein accession | YP_593552 |
Protein GI | 94971504 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.414241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAATC CAATCTTTGT GTCTACTCAG GAACAGAAAC TACAGGCCAC CACGCTGCTG ATGGCAATCG GCGTCGGCGT GGCCGTCGCC AATATTTATT ATTGCCAGCC GCTGCTCGGC ATCATGGCGC AGGACCTGCA CGTGAGTGAG CGCCATACCG GCTGGATCGC GACCCTTACG CAAGTCGGCA CAGCTATCGG CATGCTGTTG TTCGTTCCTC TCGGCGACAT TGCGGAGCGT CGCAAGCTCA CAGTGCGCAT GTGCGTGTTC GTGGCGTTCG CCGCACTGCT CACAGCGCTC GCGCCGAGCT TTTCCCTGTT GGCACTGTTC AGCTTTCTGC TGGGGCTCGG GTCCGTGATA CCGCATCTAA TCCTTCCATT CGCCGCGCAC CTGGCGCCGG AAGGCGCACG CGGTAAAGTC GTCGCAAAGG TAATCAGTGG AATGCTCGTC GGCATCCTGC TGGCACGGAC TGTTGCTGGA TTCGTCGGCG CGGCATTCGG ATGGCGCGCT ATTTTTTTCA TCTCCGCCGG GCTGATGCTC GGGCTCGCGA TCGCTTTTCG CGAGTTGCTG CCAGAGAGCC ACCCATCGGT ACAGATGCAC TACTTCGATC TTCTGCGCTC AGTAGGCAGC ATGGTGCGGG AACATCGCGG GCTGCGCGAG TCGGCGGCGA TCGGGGCTTT GCTATTCGCT TCGTTCAGCG CGTTCTGGAC CACGCTGGTG TTCTTTCTCG CCAAGCCGCC GTATCACTAT GGCGCGCGCA TGGCCGGTGG ACTTGGCTTG CTGGCTGCTG CCAGTGCGGC ACTCGCGCCC ATCGTGGGCC GCATGGTTGA CCGTCGTTCA CCGAAGCTTG GCATCTCGAT CGCGGTCATC ACAACACTTG CGTCTTATGC CGTGATGATC GCAACGGGGC ACTGGCTGAT CGGCCTGGGG CTCGGCGTCA TTCTGCTCGA CGTCGGCGTG CAGACGGGAC ACATCAGCAA CCAGACGCGC ATTTACAACA CGTTCCCGCA CGCACGCAGC CGAGCTAATA CCGTCTACAT GGTGAGCTAT TTTGTCGGCG GCGCGTTCGG GTCCGCGCTG GGCAATGCCG GTTGGCATTT CTTCGGATGG GCGGGCGTTT GCGCCGCTGG CGCGGTCGTT CTGCTTCCAG CGCTTCTGAT CGTACGCACC ATGCGCGAAG AAAAAATGGA ACGAGTGGAA GAGACAGAAT TATCAGTGGC GTAA
|
Protein sequence | MANPIFVSTQ EQKLQATTLL MAIGVGVAVA NIYYCQPLLG IMAQDLHVSE RHTGWIATLT QVGTAIGMLL FVPLGDIAER RKLTVRMCVF VAFAALLTAL APSFSLLALF SFLLGLGSVI PHLILPFAAH LAPEGARGKV VAKVISGMLV GILLARTVAG FVGAAFGWRA IFFISAGLML GLAIAFRELL PESHPSVQMH YFDLLRSVGS MVREHRGLRE SAAIGALLFA SFSAFWTTLV FFLAKPPYHY GARMAGGLGL LAAASAALAP IVGRMVDRRS PKLGISIAVI TTLASYAVMI ATGHWLIGLG LGVILLDVGV QTGHISNQTR IYNTFPHARS RANTVYMVSY FVGGAFGSAL GNAGWHFFGW AGVCAAGAVV LLPALLIVRT MREEKMERVE ETELSVA
|
| |