Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pmen_2665 |
Symbol | |
ID | 5107974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas mendocina ymp |
Kingdom | Bacteria |
Replicon accession | NC_009439 |
Strand | + |
Start bp | 2922726 |
End bp | 2924354 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640503909 |
Product | protein of unknown function DUF894, DitE |
Protein accession | YP_001188152 |
Protein GI | 146307687 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0627895 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCG ATAAACCTTC GCCTTGGGGC GCACTCAGGC ACAGCACCTT CCGCTGGCTG TGGCTGGCCA GCATCGCCTC GAACATCGGC ACCTGGATGC ACGAGGTGGG TGCCGGCTGG CTGATGACCA GCCTGTCGGC CAACCCCATG CACGTGGCGC TGGTACAGGT CGCCGGCTCG GCGCCGATGT TCCTCCTGGC GCTGCCGGCC GGCGCCATGG CCGATATCGT CGACAAGCGC CGCTATCTGC TCGGCGTGCA GCTGTGGATG GCGGCGGTGG CGACGCTGCT GGCCGTACTG ACCCTGCTCG GCCTGACCAC GGTCTGGCTG CTGCTGGGCA TGACCCTGGC CATGGGCGTC GGCACCGCGT TGATGATGCC GGCCTGGTCG GCGCTGACGC CGGAGCTGGT GAGCAAGCGC GACCTGCCCT CGGCCATCGC CCTGTCCAGC CTCGGCATCA ACGTCGCCCG TGCCCTCGGC CCGGCCATCG CCGGGGTACT GGTCAGCTTG AGCGGCCCCT GGGCGACCTT CGCCCTCAAC GCCCTGTCGT TCTTCGCGGT GATGGCGGTG CTGCTGACCT GGAAGCGCGA GCGCCAGGTG GCGGCCTTCC CGGCCGAGCG CCTGCTGGGC GCCATGCGCG CCGGCTGGCG CTATAGCCGC GCCTCGAAAC CGCTGCAATC GGTGCTGGTA CGCGCCGCGG CCTTCTTCGT CGGCGCCAGC GCCGGCATGT CGCTGCTGCC GCTGATCGTG CGCGGCGAGT TGCAGGGCAG CGCCAGCGAC TTCGGCCTGA TGCTGGGCAG CGTCGGCGTC GGCGCGGTGC TCGGCGCCAC CTTGCTGCCG CGCATTCGCG AGCGCATCAG CAGCGACCGT CTGGTGCTGC TGGCCAGCCT GCTCTACGCC CTGGTGCTGC TGGCCCTGGC CAGCCTGCGC CAGTTCGCCG CCCTGCTGCC GGTGATGCTG CTCAGCGGCG CGGCGTGGAT CGCCGTGCTG TCCAGCCTGC AGGTCGCCGC GCAGACCTCG GTGCCGGACT GGGTAAGGGC GCGCGCCCTT TCCATCTATA TCCTGGTGTT CTTCGGCAGC ATGGCCGCCG GCGGCGCGCT GTGGGGCTTC GTCGCCAGCC AGGCGTCGAT CCCCCTCGCC CTGTTCGCCG CGGCCGGCTG TCTGGCGCTG GGCGGGCTGC TGACGTCGCG CTTTCCGCTG CCGGTTACCG AAGCCGAGGA TCTCGCCCCG TCGCTGCACT GGCCGGCGCC GATCCTGGCC GACGAAGCCG ACCTCGAGCG CGGTCCGGTG ATGGTCACGC TGCACTACGA CATCGCGCCG GAGCATGCCT CGGCCTTCCG CCAGGCAATG AGCGAGGTGG CGCGCATGCG CAGACGCAAC GGTGCGTTCT CCTGGGGTCT GGTGCAGAGC AGCGAGAACC CGCGCCACTG GCAGGAGTTC TTCTTCGACG AGTCCTGGCT CGAACACCTG CGCCACCACG GCCGGGTGAC CCGCGCCGAG CAACGCATCG AGGCGGCAGC CAGGCAGTTC CAGAGCGCCG GGGTCGCGAT ACGCATCGAC CACTTCCTGA TGCCGGGCAA GCAGGCGCCA GAGCATTCGG ACATTCAGCA TGCCCACGGT CAGCCATGA
|
Protein sequence | MASDKPSPWG ALRHSTFRWL WLASIASNIG TWMHEVGAGW LMTSLSANPM HVALVQVAGS APMFLLALPA GAMADIVDKR RYLLGVQLWM AAVATLLAVL TLLGLTTVWL LLGMTLAMGV GTALMMPAWS ALTPELVSKR DLPSAIALSS LGINVARALG PAIAGVLVSL SGPWATFALN ALSFFAVMAV LLTWKRERQV AAFPAERLLG AMRAGWRYSR ASKPLQSVLV RAAAFFVGAS AGMSLLPLIV RGELQGSASD FGLMLGSVGV GAVLGATLLP RIRERISSDR LVLLASLLYA LVLLALASLR QFAALLPVML LSGAAWIAVL SSLQVAAQTS VPDWVRARAL SIYILVFFGS MAAGGALWGF VASQASIPLA LFAAAGCLAL GGLLTSRFPL PVTEAEDLAP SLHWPAPILA DEADLERGPV MVTLHYDIAP EHASAFRQAM SEVARMRRRN GAFSWGLVQS SENPRHWQEF FFDESWLEHL RHHGRVTRAE QRIEAAARQF QSAGVAIRID HFLMPGKQAP EHSDIQHAHG QP
|
| |