Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1984 |
Symbol | |
ID | 6068177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2192542 |
End bp | 2194554 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641601398 |
Product | fusaric acid resistance protein region |
Protein accession | YP_001724957 |
Protein GI | 170020003 |
COG category | [S] Function unknown |
COG ID | [COG1289] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0152424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000622676 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGCAT CGTCATGGTC CTTGCGCAAT TTGCCCTGGT TCAGGGCCAC GCTGGCGCAA TGGCGTTATG CGTTACGCAA TACCATTGCC ATGTGTCTGG CGCTGACGGT TGCCTATTAT TTAAATCTGG ATGAACCCTA TTGGGCGATG ACCTCGGCTG CAGTGGTTAG CTTTCCCACC GTTGGCGGTG TTATCAGCAA AAGCCTCGGA CGCATCGCTG GCAGTTTGCT CGGAGCCATT GCGGCACTGC TTCTTGCCGG GCATACGCTC AATGAGCCGT GGTTTTTTCT ATTGAGCATG TCGGCGTGGC TTGGCTTTTG TACCTGGGCC TGTGCGCACT TCACGAATAA CGTCGCGTAT GCATTTCAAC TGGCGGGCTA CACGGCTGCC ATCATCGCCT TTCCGATGGT TAATATTACT GAGGCCAGCC AGCTGTGGGA TATCGCTCAG GCGCGCGTTT GCGAGGTGAT TGTCGGCATT TTGTGCGGCG GCATGATGAT GATGATCCTG CCTAGCAGTT CCGATGCTAC AGCCCTTTTA ACCGCATTGA AAAACATGCA CGCCCGACTA CTTGAACATG CCAGTTTACT CTGGCAGCCT GAAACAACCG ATGCCATTCG TGCAGCACAT GAAGGGGTGA TTGGGCAGAT ACTGACCATG AATTTGCTGC GTATCCAGGC TTTCTGGAGC CACTATCGTT TTCGCCAGCA AAACGCGCGC CTTAATGCGC TGCTCCACCA GCAATTACGT ATGACCAGTG TCATCTCCAG CCTGCGACGT ATGTTGCTCA ACTGGCCCTC ACCGCCAGGT GCCACACGAG AAATTCTCGA ACAGTTGCTG ACGGCGCTCG CCAGTTCGCA AACAGATGTT TACACCGTCG CACGTATTAT CGCCTCGCTA CGCCCGACCA ACGTCGCCGA CTATCGGCAC GTCGCCTTCT GGCAGCGACT ACGTTATTTT TGCCGCCTTT ATCTGCAAAG TAGTCAGGAA TTACATCGTC TGCAAAGCGG TGTAGATGAT CATACCAGAC TCCCACGGAC ATCCGGCCTG GCTCGTCATA CCGATAACGC CGAAGCTATG TGGAGCGGGC TGCGTACATT TTGTACGTTG ATGATGATTG GCGCATGGAG TATTGCTTCG CAATGGGATG CCGGTGCCAA TGCATTAACG CTGGCAGCAA TTAGCTGCGT ACTCTACTCC GCCGTCGCAG CACCGTTTAA GTCGTTGTCA CTTCTGATGC GCACGCTGGT GTTACTTTCG CTATTCAGCT TTGTGGTCAA ATTTGGTCTG ATGGTCCAGA TTAGCGATCT GTGGCAATTT TTACTGTTTC TCTTTCCACT GCTGGCGACA ATGCAGCTTC TTAAATTGCA GATGCCAAAA TTTGCCGCAT TGTGGGGGCA ACTGATTGTT TTTATGGGTT CTTTTATCGC TGTCACTAAT CCCCCGGTGT ATGATTTTGC TGATTTTCTT AACGATAATC TGGCAAAAAT CGTTGGCGTC GCGTTGGCGT GGTTAGCGTT CGCCATTCTG CGTCCAGGAT CGGATGCTCG TAAAAGCCGC CGCCATATTC GCGCGCTGCG CCGGGATTTT GTCGATCAGC TAAGCCGCCA TCCAACACTG AGTGAAAGCG AATTTGAATC GCTCACTTAT CATCACGTCA GTCAGTTGAG TAACAGCCAG GATGCGCTGG CTCGCCGTTG GTTATTACGC TGGGGTGTAG TGCTGCTGAA CTGTTCTCAT GTTGTCTGGC AATTGCGCGA CTGGGAATCG CGTTCCGATC CGTTATCGCG AGTACGGGAT AACTGTATTT CACTGTTGCG GGGAGTGATG AGTGAGCGTG GCGTTCAGCA AAAATCACTG GCGGCCACAC TTGAAGAATT ACAGCGGATT TGCGACAGCC TTGCCCGTCA TCATCAACCT GCCGCCCGTG AGCTGGCGGC AATTGTCTGG CGGCTGTACT GCTCGCTTTC GCAACTTGAG CAAGCACCAC CGCAAGGTAC GCTGGCCTCT TAA
|
Protein sequence | MNASSWSLRN LPWFRATLAQ WRYALRNTIA MCLALTVAYY LNLDEPYWAM TSAAVVSFPT VGGVISKSLG RIAGSLLGAI AALLLAGHTL NEPWFFLLSM SAWLGFCTWA CAHFTNNVAY AFQLAGYTAA IIAFPMVNIT EASQLWDIAQ ARVCEVIVGI LCGGMMMMIL PSSSDATALL TALKNMHARL LEHASLLWQP ETTDAIRAAH EGVIGQILTM NLLRIQAFWS HYRFRQQNAR LNALLHQQLR MTSVISSLRR MLLNWPSPPG ATREILEQLL TALASSQTDV YTVARIIASL RPTNVADYRH VAFWQRLRYF CRLYLQSSQE LHRLQSGVDD HTRLPRTSGL ARHTDNAEAM WSGLRTFCTL MMIGAWSIAS QWDAGANALT LAAISCVLYS AVAAPFKSLS LLMRTLVLLS LFSFVVKFGL MVQISDLWQF LLFLFPLLAT MQLLKLQMPK FAALWGQLIV FMGSFIAVTN PPVYDFADFL NDNLAKIVGV ALAWLAFAIL RPGSDARKSR RHIRALRRDF VDQLSRHPTL SESEFESLTY HHVSQLSNSQ DALARRWLLR WGVVLLNCSH VVWQLRDWES RSDPLSRVRD NCISLLRGVM SERGVQQKSL AATLEELQRI CDSLARHHQP AARELAAIVW RLYCSLSQLE QAPPQGTLAS
|
| |