Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2170 |
Symbol | uup |
ID | 6144963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2175723 |
End bp | 2177630 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617046 |
Product | ABC transporter ATPase component |
Protein accession | YP_001744220 |
Protein GI | 170681436 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000710719 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.220785 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATTAA TCAGTATGCA TGGCGCATGG CTGTCGTTCA GCGACGCGCC GCTTCTCGAT AACGCAGAAC TGCATATCGA AGATAACGAA CGTGTTTGTC TGGTAGGTCG TAACGGCGCA GGCAAATCGA CGTTAATGAA GATCCTCAAC CGTGAACAGG GGCTGGATGA CGGTCGCATT ATTTATGAGC AAGATTTGAT TGTAGCTCGT CTGCAACAGG ATCCGCCGCG TAACGTTGAG GGTAGCGTTT ATGATTTCGT TGCCGAAGGC ATTGAAGAAC AAGCGGAATA TCTGAAACGC TATCACGATC TTTCGCGCCT GGTGATGAAC GACCCGAGCG AGAAAAATCT CAACGAACTG GCGAAGGTTC AGGAACAGCT GGATCACCAC AACCTGTGGC AGCTGGAAAA CCGCATCAAC GAAGTGCTGG CGCAACTGGG ATTAGATCCT AATGTTGCGC TGTCGTCGCT TTCCGGCGGC TGGTTGCGTA AAGCGGCATT AGGACGCGCG CTGGTGAGCA ATCCGCGTGT ATTGCTGCTC GACGAACCGA CTAACCACCT GGATATTGAA ACCATCGACT GGCTGGAAGG GTTCCTCAAA ACCTTCAATG GGACAATTAT TTTCATTTCC CACGACCGTT CGTTTATCCG CAATATGGCG ACGCGCATTG TTGATCTCGA TCGCGGCAAG CTGGTGACCT ATCCAGGGAA TTACGACCAG TACCTGCTGG AAAAAGAAGA AGCCCTGCGC GTGGAAGAAT TACAAAATGC CGAGTTCGAT CGCAAACTGG CGCAGGAAGA GGTGTGGATC CGCCAGGGGA TCAAAGCGCG CCGTACCCGT AATGAAGGCC GCGTACGCGC CCTGAAAGCA ATGCGCCGTG AACGTAGCCA GCGCCGTGAG GTCATGGGGA CGGCAAAGAT GCAGGTGGAA GAAGCCAGCC GCTCCGGAAA GATCGTTTTC GAAATGGAAG ACGTTTGCTA CCAGGTTGAC GGTAAGCAAC TGGTGAAAGA TTTTTCTGCC CAGGTTCTAC GTGGCGACAA AATTGCCCTG ATTGGCCCGA ATGGGTGCGG CAAAACCACG CTGCTGAAAC TGATGCTCGG TCAGCTTCAG GCGGACAGCG GGCGTATTCA CGTTGGCACC AAACTGGAAG TGGCTTATTT CGATCAGCAC CGTGCGGAAC TGGATCCCGA TAAAACGGTG ATGGATAACC TTGCCGAAGG TAAGCAAGAG GTGATGGTTA ACGGCAAGCC ACGCCACGTA TTGGGCTATT TGCAGGACTT CCTGTTCCAT CCGAAACGGG CGATGACGCC GGTACGTGCG CTTTCTGGCG GTGAGCGGAA CCGCTTGCTG CTGGCGCGTT TGTTCCTCAA ACCAAGCAAC TTATTGATTC TTGACGAACC GACCAACGAT CTTGATGTCG AAACGCTGGA ACTGCTGGAA GAACTGATCG ACAGCTACCA GGGCACGGTA TTGCTGGTAA GCCACGATCG TCAGTTTGTC GATAACACCG TTACAGAATG CTGGATCTTC GAAGGCGGCG GTAAAATTGG TCGTTATGTC GGCGGTTATC ATGATGCCCG TGGTCAGCAA GAGCAGTATG TGGCGCTCAA ACAGCCTGCG GTGAAAAAAA CCGAAGAAGC CGCCGCGGCA AAAGCAGAAA CTGTAAAACG CAGCAGTAGC AAACTAAGCT ATAAATTGCA GCGCGAACTG GAGCAGCTAC CGCAATTGCT TGAAGATCTG GAGGCGAAGC TGGAAGCCCT ACAGACGCAG GTGGCGGATG CTTCCTTCTT CAGTCAGCCC CATGAGCAGA CACAAAAAGT GCTCGCTGAT ATGGCTGCTG CAGAGCAGGA GCTGGAGCAA GCCTTTGAAC GCTGGGAGTA TCTTGAAGCG TTAAAAAATG GTGGCTGA
|
Protein sequence | MSLISMHGAW LSFSDAPLLD NAELHIEDNE RVCLVGRNGA GKSTLMKILN REQGLDDGRI IYEQDLIVAR LQQDPPRNVE GSVYDFVAEG IEEQAEYLKR YHDLSRLVMN DPSEKNLNEL AKVQEQLDHH NLWQLENRIN EVLAQLGLDP NVALSSLSGG WLRKAALGRA LVSNPRVLLL DEPTNHLDIE TIDWLEGFLK TFNGTIIFIS HDRSFIRNMA TRIVDLDRGK LVTYPGNYDQ YLLEKEEALR VEELQNAEFD RKLAQEEVWI RQGIKARRTR NEGRVRALKA MRRERSQRRE VMGTAKMQVE EASRSGKIVF EMEDVCYQVD GKQLVKDFSA QVLRGDKIAL IGPNGCGKTT LLKLMLGQLQ ADSGRIHVGT KLEVAYFDQH RAELDPDKTV MDNLAEGKQE VMVNGKPRHV LGYLQDFLFH PKRAMTPVRA LSGGERNRLL LARLFLKPSN LLILDEPTND LDVETLELLE ELIDSYQGTV LLVSHDRQFV DNTVTECWIF EGGGKIGRYV GGYHDARGQQ EQYVALKQPA VKKTEEAAAA KAETVKRSSS KLSYKLQREL EQLPQLLEDL EAKLEALQTQ VADASFFSQP HEQTQKVLAD MAAAEQELEQ AFERWEYLEA LKNGG
|
| |