Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0907 |
Symbol | |
ID | 6262609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | - |
Start bp | 1004856 |
End bp | 1006148 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642611388 |
Product | aromatic hydrocarbon degradation membrane protein |
Protein accession | YP_001875799 |
Protein GI | 187251317 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2067] Long-chain fatty acid transport protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000000000931505 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATAAAAA GTAATATGTT TAAGTTGTTA AAAGCAGTAT TTGTAATATG CCCGTTTGTT TTTGCCAGCA TGGCGGATTC AGCGGGTTTT GCTCTTTATG AGTTTAGCGC AAGAGGCAAT GCCATGGGCG GCGCGGTTAT GGCCAACAAG GCGGAGCCCG CTTCAATAGC CACTAATCCT GCGCTTATCA CCCAGCTTGA AGGCACACAG CTTCAAACAG GCATAACGGC AGTTATTGTA TCCGGTTCCA CAACCATAGG AACCGAAAAA AGAGATTTGG AAACAGGCGC ATTCTACCTT CCCGCTTTCT TTCTTACTCA ACAGTTAAGG GAAGATGTTT TTTTCGGTTT AGGTTTTTTT CCGAGATATG GCCTTGGCGG CAGATATAAA GATTATGAAA CATGGTCAAT GGGCACCGTT TTAAGGGAAG CTTATACGGT AGACTTAATG ACTTATTCCT TTAACCCCAA TTTAGCGGTT AAAGTAACGG ATGACTTGTC TTTCGCTATG GGTCTTGAGG TTATGTTTCT TGATTTTCAG GAAAGGAAAC ACCTTGTCGG CGGCGCAAAA ATGGATATTT CCGGTAATTC CACGACATGG GGCGGAAATT TTGGCGTTTT GTATAACCCC GGCTGGGCCG AAAAATGGGC TGCGGCTTTA ACTTACAGAA CAAAAACAAG GCACGTAGCC ACCGGCAGGG TAAAAACCCC AGGCTTGGCG CTTCCTCCCG CTATATCTTT TGACGGTAAC GCCTCCGCCG CACTTACGCT GCCTGACCAG TTGGCCTTCG GTTTGTCTTT TACTCCTACA AATAAACTTA CCATGGAAGC CAACATTATG GGCATTTTTT GGAGTTCTTA CAGCCAATTA AAAATTGATT ATGACGACCT TAGCCAGTCG CCGGGCGGGC CGGGTAATCC TCCGTATTTA AACGAAAACA AAAATTATAA AGATGTTTTC CGCATAGGTT TTGGCGTTGA ATACTCTTTA AATCCCACAT GGGATTTAAG GGCGGGTTAT GTTTTCGATA AGTCCCCTAT TAATAAAAAG TATATGGATA CTCTTGTTCC TGCAGATGAC AGAAATATCT TCAGCGTCGG CGCGGGGTAT AACGTATCTG AAAGAATGGG TATAGACGTT TCTTATTCAT ATGTTTTAAT AAGCGACCTT TCCGGCAGAA ACGTGGAAAA AGGAAATACA GTGTTTAAGT ATGAAGACGC TTCAAGCCAC ATGATAGGCT TATCCTTTAA ATACGCTTTT GGAAACGGCC CCTTAAGTAA CAGAGTTATA TAA
|
Protein sequence | MIKSNMFKLL KAVFVICPFV FASMADSAGF ALYEFSARGN AMGGAVMANK AEPASIATNP ALITQLEGTQ LQTGITAVIV SGSTTIGTEK RDLETGAFYL PAFFLTQQLR EDVFFGLGFF PRYGLGGRYK DYETWSMGTV LREAYTVDLM TYSFNPNLAV KVTDDLSFAM GLEVMFLDFQ ERKHLVGGAK MDISGNSTTW GGNFGVLYNP GWAEKWAAAL TYRTKTRHVA TGRVKTPGLA LPPAISFDGN ASAALTLPDQ LAFGLSFTPT NKLTMEANIM GIFWSSYSQL KIDYDDLSQS PGGPGNPPYL NENKNYKDVF RIGFGVEYSL NPTWDLRAGY VFDKSPINKK YMDTLVPADD RNIFSVGAGY NVSERMGIDV SYSYVLISDL SGRNVEKGNT VFKYEDASSH MIGLSFKYAF GNGPLSNRVI
|
| |