Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1603 |
Symbol | |
ID | 8252705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 1898457 |
End bp | 1899683 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644935257 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003091878 |
Protein GI | 255531506 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.501272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAA CAAATCCGGC CTCTGCCATT GAGCCTAAAC AAATTGCCCA GCAAACCGTA TTTCCGATAC TTTTTGCCAT TAGTTTTTCC CATTTACTAA ATGATACGAT ACAGTCGCTC ATACCTGCCA TTTATCCTAT TGTAAAGAAT ACCTATCAGC TGAGCTTTTC GCAGATTGGC TTAATTACCC TGATGTTCCA AATGGCCGCT TCTTTATTCC AGCCATTTGT AGGCTTATAT ACCGATAAAA AACCACAGCC TTATTCACTG GCTATAGGAA TGGGTTTTAC GCTGGTCGGC CTGATCACTT TATCTTTGTC CAACGGATTT TACTTCATGC TGCTTTCTGT TGCACTTATT GGTACGGGCT CTTCCATATT CCATCCGGAA GCATCTCGTA TGGCCCATGC TGCTTCAGGC GGAAGGAGAG GCCTGGCCCA GTCCATCTTT CAGCTGGGTG GCAATGCCGG AAGTTCTATC GGACCTTTGC TGGCAGCCTG GATCATTGTG CCCTACGGAC AGTTCAGTGT GATCTGGTTT TCTATCATTG CTTTACTGGC CATTATGATT TTGAGCTGGG TAGGCAAATG GTATAAGGGC TATATGGTCA ATTTAAAGGC CAGAATGGGG GCAAAAGTAA ATGTGGTAAC CAATAATTTC TCCAGAAAAA GGGTGGTATT TGCCGTGATC ATTTTACTGG TCCTTATCTT TTCAAAATAC TTTTACATGG CCAGTCTGAC CAGCTACTTT ACCTTCTATC TAATAGATAA GTTTCATGTA CCGGTGCAAA CCTCGCAGCT TTACCTGTTT GTATTTTTAT TTTCCGTTGC GGCCGGTACA CTGATCGGTG GTCCGGTGGG CGACAGGTTC GGCCGTAAAT ATGTGATCTG GTTTTCTATT TTAGGTACAG CACCTTTTGC CTTGTTGCTG CCCCATGCCA ATTTATTCTG GACCGGGGTA TTGATCGTAC CGATAGGTGT GATCCTGGCC TCAGCATTCT CTGCTATTCT GGTGTATGCG CAGGAACTGA TACCGGGTAA GGTGGGACTG GTTGCGGGAT TGTTCTTTGG TTTTGCTTTT GGTATGGGCG GTATAGGGTC TGCTTTACTG GGTAAGCTTG CCGATAGCAC CAGCATCAAT TACGTATTTC ATATCTGTGC ATTCTTGCCC CTGATTGGTA TCATTACCGG GTTTTTGCCC AATATTGAGG GCAGGAAAAA AGCCTGA
|
Protein sequence | MKTTNPASAI EPKQIAQQTV FPILFAISFS HLLNDTIQSL IPAIYPIVKN TYQLSFSQIG LITLMFQMAA SLFQPFVGLY TDKKPQPYSL AIGMGFTLVG LITLSLSNGF YFMLLSVALI GTGSSIFHPE ASRMAHAASG GRRGLAQSIF QLGGNAGSSI GPLLAAWIIV PYGQFSVIWF SIIALLAIMI LSWVGKWYKG YMVNLKARMG AKVNVVTNNF SRKRVVFAVI ILLVLIFSKY FYMASLTSYF TFYLIDKFHV PVQTSQLYLF VFLFSVAAGT LIGGPVGDRF GRKYVIWFSI LGTAPFALLL PHANLFWTGV LIVPIGVILA SAFSAILVYA QELIPGKVGL VAGLFFGFAF GMGGIGSALL GKLADSTSIN YVFHICAFLP LIGIITGFLP NIEGRKKA
|
| |