Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2207 |
Symbol | |
ID | 8253313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 2541370 |
End bp | 2542560 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644935856 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003092473 |
Protein GI | 255532101 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.78926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCAT CTTCCCATTC AATTTATACC CTTCAGTTCG GTCTCGTTTG TTTAAGCTCT TTCCTTTTTT CTGCCAGCTT TAATATGCTC ATCCCCGAAC TTCCTGCTTA TTTAACCGCA ATGGGGGGCG CAGCGTATAA AGGGCTCATT ATTGCACTGT TTACACTCAC TGCCGGAATA TCCAGACCTT TCAGTGGTAA ACTCACGGAT ACCATTGGCC GTGTACCGGT AATGGCAGTG GGCTCACTGG TTTGTTTTTT ATGCGGTTTT CTTTATCCGC TCCTCACTAC CATTGCCGGA TTTCTGTTCC TCAGGTTAAT ACATGGTTTC TCTACAGGCT TTAAGCCTAC CGCTACGGCC GCCTATGTGG CCGATCTGGT TCCTCCAGGA AAATGGGGCG AGGCGATGGG GGTACATGGT GTATGTTTTA GCACCGGCCT GGCCATTGGC CCCGCAATTG GCAGTACCAT CACCGATCAT TACAGCATCA ATGTGCTGTT TTACTGTTCT TCTTTATTTG CCCTGCTTTC CATTGTTATC CTGGCCAACA TGAAAGAAAC CCTGCCAGGC AAACAAAAAT TCCGCGCAGC ACATTTAAAG ATCAATAAAA AAGACATCAT TGAATGGCGG GTGATCCCGG CCGTGGTGAT CATCTTTTTA AGTTACATCA GCTATGGCTC CATACTCACG GTCATATCCG ATTGGAGTGC ACACCTGGGC ACCAGTAACA AAGGTCTGTT CTTTATGGTT TTTACGCTCA CTTCTTTATT GATCCGCTTT GTGGCCGGTA AGGCATCCGA CAGGTATGGC CGTACACTCA TTTTAAGAAT ATCCCTCGGC CTGCTGGCTG TATCCCTGAT GCTGATCGCC ATAGCCAGTT CTTCTTTTAC CCTGATGATG GCATCTGCTT TATATGGGGT GGCTACAGGT ATGCTCTCGC CAACAGCAAC GGCCTGGACG GTAGACCTGA GCGAACCCAC ACAAAGGGGT AAGGCCATGG CCACCATGTA CATTGCCCTC GAAGCTGGTA TTGGTTTGGG TGCACTCCTT GCCGGCTGGT TGTTTATAGA CAATATCCGC ATGATCCCTG TAACTTTTTA CTGCTGTACA GGCATTACAC TGATTGCCCT GGTTTACCTT CAGTTTTTTT ACCGGACAAA GCAGTACATT TCTCCTAAAA ATGGCAGTTA G
|
Protein sequence | MQPSSHSIYT LQFGLVCLSS FLFSASFNML IPELPAYLTA MGGAAYKGLI IALFTLTAGI SRPFSGKLTD TIGRVPVMAV GSLVCFLCGF LYPLLTTIAG FLFLRLIHGF STGFKPTATA AYVADLVPPG KWGEAMGVHG VCFSTGLAIG PAIGSTITDH YSINVLFYCS SLFALLSIVI LANMKETLPG KQKFRAAHLK INKKDIIEWR VIPAVVIIFL SYISYGSILT VISDWSAHLG TSNKGLFFMV FTLTSLLIRF VAGKASDRYG RTLILRISLG LLAVSLMLIA IASSSFTLMM ASALYGVATG MLSPTATAWT VDLSEPTQRG KAMATMYIAL EAGIGLGALL AGWLFIDNIR MIPVTFYCCT GITLIALVYL QFFYRTKQYI SPKNGS
|
| |