Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00960 |
Symbol | uup |
ID | 8116205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1014463 |
End bp | 1016370 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644847222 |
Product | hypothetical protein |
Protein accession | YP_002998795 |
Protein GI | 251784491 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000554098 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTAA TCAGTATGCA TGGCGCATGG CTGTCGTTCA GCGACGCGCC GCTTCTCGAT AACGCAGAAC TGCATATCGA AGATAACGAA CGTGTTTGTC TGGTGGGCCG CAACGGCGCA GGCAAATCGA CGTTAATGAA AATCCTCAAC CGTGAACAAG GGCTGGATGA CGGTCGCATT ATTTACGAGC AAGATTTGAT TGTAGCGCGT CTGCAACAGG ATCCGCCGCG TAACGTTGAG GGTAGCGTTT ATGATTTCGT TGCCGAAGGC ATTGAAGAAC AAGCGGAATA TCTGAAACGC TATCACGATA TTTCGCGCCT GGTGATGAAC GACCCGAGCG AGAAAAATCT CAACGAACTG GCGAAGGTTC AGGAACAGCT GGATCACCAC AACCTGTGGC AGCTGGAAAA CCGCATCAAC GAAGTGCTGG CGCAACTGGG GTTAGATCCT AACGTTGCGC TGTCGTCGCT TTCCGGCGGC TGGTTGCGTA AAGCGGCATT AGGACGCGCG CTGGTGAGTA ATCCGCGCGT GCTGTTGCTT GATGAACCGA CAAACCACCT GGATATTGAA ACCATCGACT GGCTGGAAGG GTTTTTGAAA ACTTTCAACG GGACGATTAT TTTCATCTCC CACGACCGTT CGTTTATCCG CAATATGGCG ACGCGCATTG TTGATCTCGA TCGCGGCAAG CTGGTGACCT ATCCAGGGAA TTACGACCAG TACCTGCTGG AAAAAGAAGA AGCCCTGCGC GTGGAAGAAT TACAAAATGC CGAGTTCGAT CGCAAACTGG CGCAGGAAGA GGTGTGGATC CGCCAGGGGA TCAAAGCACG CCGTACCCGT AATGAAGGCC GCGTACGCGC CCTGAAAGCG ATGCGTCGCG AACGTGGTGA ACGTCGCGAA GTGATGGGTA CCGCAAAGAT GCAGGTGGAA GAGGCCAGCC GCTCCGGTAA AATCGTTTTC GAAATGGAAG ACGTTTGCTA CCAGGTTAAC GGTAAGCAAC TGGTGAAAGA TTTTTCTGCC CAGGTTCTAC GTGGCGACAA AATTGCCCTG ATTGGTCCGA ATGGGTGCGG CAAAACCACG CTGCTAAAAC TGATGCTCGG TCAGCTTCAA GCGGACAGCG GGCGTATTCA CGTTGGCACC AAACTGGAAG TGGCTTATTT CGATCAGCAC CGCGCGGAAC TGGATCCCGA TAAAACGGTG ATGGATAACC TTGCCGAAGG TAAGCAAGAG GTGATGGTTA ACGGCAAGCC ACGCCACGTA TTGGGCTATT TGCAGGACTT TCTGTTCCAT CCGAAACGGG CGATGACGCC GGTACGTGCG CTTTCTGGCG GTGAGCGGAA CCGCTTGCTG CTGGCGCGTT TGTTCCTCAA ACCAAGCAAC TTATTGATTC TTGACGAACC GACCAACGAT CTTGATGTCG AAACGCTGGA ACTGCTGGAA GAACTGATCG ACAGCTATCA GGGCACGGTA TTGCTGGTTA GCCACGATCG TCAGTTTGTC GATAACACCG TTACAGAATG TTGGATCTTC GAAGGCGGCG GTAAAATTGG TCGTTATGTC GGCGGTTATC ATGATGCCCG TGGTCAGCAA GAGCAGTATG TGGCGCTCAA ACAGCCTGCG GTGAAAAAAA CCGAAGAAGC CGCCGCGGCA AAAGCAGAAA CTGTAAAACG CAGCAGTAGC AAACTAAGCT ATAAATTGCA GCGCGAACTG GAGCAGCTAC CGCAATTGCT CGAAGATCTG GAGGCGAAGC TGGAAGCCCT ACAGACGCAA GTGGCGGATG CTTCCTTCTT CAGTCAGCCG CATGAGCAGA CGCAAAAAGT GCTTGCTGAT ATGGCTGCTG CAGAGCAGGA GCTGGAGCAA GCCTTTGAAC GCTGGGAGTA TCTTGAAGCG TTAAAAAATG GTGGCTGA
|
Protein sequence | MSLISMHGAW LSFSDAPLLD NAELHIEDNE RVCLVGRNGA GKSTLMKILN REQGLDDGRI IYEQDLIVAR LQQDPPRNVE GSVYDFVAEG IEEQAEYLKR YHDISRLVMN DPSEKNLNEL AKVQEQLDHH NLWQLENRIN EVLAQLGLDP NVALSSLSGG WLRKAALGRA LVSNPRVLLL DEPTNHLDIE TIDWLEGFLK TFNGTIIFIS HDRSFIRNMA TRIVDLDRGK LVTYPGNYDQ YLLEKEEALR VEELQNAEFD RKLAQEEVWI RQGIKARRTR NEGRVRALKA MRRERGERRE VMGTAKMQVE EASRSGKIVF EMEDVCYQVN GKQLVKDFSA QVLRGDKIAL IGPNGCGKTT LLKLMLGQLQ ADSGRIHVGT KLEVAYFDQH RAELDPDKTV MDNLAEGKQE VMVNGKPRHV LGYLQDFLFH PKRAMTPVRA LSGGERNRLL LARLFLKPSN LLILDEPTND LDVETLELLE ELIDSYQGTV LLVSHDRQFV DNTVTECWIF EGGGKIGRYV GGYHDARGQQ EQYVALKQPA VKKTEEAAAA KAETVKRSSS KLSYKLQREL EQLPQLLEDL EAKLEALQTQ VADASFFSQP HEQTQKVLAD MAAAEQELEQ AFERWEYLEA LKNGG
|
| |