Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1354 |
Symbol | |
ID | 6143516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1342552 |
End bp | 1343790 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616232 |
Product | PqiA family integral membrane protein |
Protein accession | YP_001743412 |
Protein GI | 170683557 |
COG category | [S] Function unknown |
COG ID | [COG2995] Uncharacterized paraquat-inducible protein A |
TIGRFAM ID | [TIGR00155] integral membrane protein, PqiA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000343828 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.147943 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGGCTA TCGGCGAGGA ACTGCCGCGT GGTGATTACC AACGTTGCCC GCAATGTGAC ATGCTGTTTA GCCTGCCCGA GATAAATTCT CATCAAAGTG CCTATTGTCC GCGCTGTCAG GCAAAAATTC GTGACGGGCG CGACTGGTCG CTAACGCGCC TGGCGGCAAT GGCGTTCGCT ATGCTGTTGT TGATGCCGTT TGCCTGGGGC GAACCGCTGT TGCATATCTG GCTGTTAGGC ATCCGTATTG ACGCCAACGT TATGCAAGGC ATCTGGCAAA TGACCAAACA GGGCGATGCG ATAACGGGGT CGATGGTCTT TTTTTGCGTT ATTGGTGCCC CCCTTATTCT GGTGACCTCC ATAGCTTATT TATGGTTTGG TAACCGACTG GGAATGAATT TACGTCCGGT ACTGCTGATG CTTGAGCGAC TGAAAGAGTG GGTAATGCTC GATATCTACC TGGTCGGCAT TGGCGTTGCT TCTATAAAGG TACAGGATTA TGCCCATATC CAGGCGGGTG TCGGCTTGTT CTCTTTTGTG GCGTTGGTGA TTTTAACGAC GGTGACGTTG TCACATCTTA ATGTCGAAGA ACTGTGGGAG CGATTTTATC CGCAGTGCCC CGCTACGCGA AGGGACGAGA AACTCCGTGT CTGTCTTGGG TGTCATTTTA CCGGCTATCC TGATCAGCGT GGTCGCTGCC CGCGTTGCCA TATCCCGCTA CGCCTGCGTC GCCGTCATAG TCTGCAAAAA TGCTGGGCGG CGCTGTTAGC GTCAATCGTT TTGTTGTTAC CTGCCAACCT GTTGCCTATC TCTATCATTT ATCTGAATGG TGGTCGGCAG GAAGATACGA TTCTTTCCGG AATTATGTCG CTGGCAAGTA GCAACATTGC GGTTGCGGGA ATCGTGTTTA TCGCCAGTAT TCTGGTACCG TTTACTAAAG TGATCGTCAT GTTCACTTTA CTGTTGAGTA TTCATTTTAA ATGCCAGCAA GGTTTACGCA CACGCATTCT GTTACTGCGG ATGGTGACCT GGATTGGTCG CTGGTCGATG CTCGACCTGT TTGTCATATC TTTAACCATG TCGCTGATTA ATCGCGATCA GATCCTCGCT TTTACTATGG GACCGGCTGC GTTTTATTTC GGCGCAGCGG TAATTTTGAC TATTCTTGCT GTGGAATGGC TGGACAGCCG CTTACTTTGG GATGCACATG AGTCAGGAAA CGCCCGCTTC GACGACTGA
|
Protein sequence | MRAIGEELPR GDYQRCPQCD MLFSLPEINS HQSAYCPRCQ AKIRDGRDWS LTRLAAMAFA MLLLMPFAWG EPLLHIWLLG IRIDANVMQG IWQMTKQGDA ITGSMVFFCV IGAPLILVTS IAYLWFGNRL GMNLRPVLLM LERLKEWVML DIYLVGIGVA SIKVQDYAHI QAGVGLFSFV ALVILTTVTL SHLNVEELWE RFYPQCPATR RDEKLRVCLG CHFTGYPDQR GRCPRCHIPL RLRRRHSLQK CWAALLASIV LLLPANLLPI SIIYLNGGRQ EDTILSGIMS LASSNIAVAG IVFIASILVP FTKVIVMFTL LLSIHFKCQQ GLRTRILLLR MVTWIGRWSM LDLFVISLTM SLINRDQILA FTMGPAAFYF GAAVILTILA VEWLDSRLLW DAHESGNARF DD
|
| |