Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02499 |
Symbol | proX |
ID | 8113464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 2646199 |
End bp | 2647191 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644848699 |
Product | hypothetical protein |
Protein accession | YP_003000272 |
Protein GI | 251785968 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACATA GCGTACTTTT TGCGACAGCG TTTGCCACGC TTATCTCTAC ACAAACTTTT GCTGCCGATC TGCCGGGCAA AGGCATTACT GTTAATCCAG TTCAGAGCAC CATCACTGAA GAAACCTTCC AGACGCTGCT GGTCAGTCGT GCGCTGGAGA AATTAGGTTA TACCGTCAAC AAACCCAGCG AAGTAGATTA CAACGTTGGC TACACCTCGC TTGCTTCCGG CGATGCAACC TTCACCGCCG TGAACTGGAC GCCACTGCAT GACAACATGT ACGAAGCTGC CGGTGGCGAT AAGAAATTTT ATCGTGAAGG GGTATTTGTT AACGGCGCGG CACAGGGTTA CCTGATCGAT AAGAAAACCG CCGACCAGTA CAAAATCACC AACATCGCAC AACTGAAAGA TCCGAAGATC GCCAAACTGT TCGATACCAA CGGCGACGGA AAAGCGGATT TAACCGGTTG TAACCCTGGC TGGGGCTGCG AAGGTGCGAT CAACCACCAG CTTGCCGCGT ATGAACTGAC CCATACCGTG ACGCATAATC AGGGGAACTA CGCGGCGATG ATGGCCGACA CCATCAGTCG CTACAAAGAG GGCAAACCGG TGTTTTACTA CACCTGGACG CCGTACTGGG TGAGTAATGA GCTGAAGCCA GGGAAAGATG TGGTCTGGTT GCAGGTGCCG TTCTCCGCAC TGCCGGGCGA TAAAAACGCC GATACCAAAC TGCCGAATGG TGCGAATTAT GGCTTCCCGG TCAGCACCAT GCATATCGTT GCCAACAAAG CCTGGGCCGA GAAAAACCCG GCAGCAGCGA AACTGTTTGC CATTATGCAG TTGCCAGTGG CAGATATTAA CGCCCAGAAC GCCATTATGC ATGACGGCAA AGCCTCAGAA GGCGATATTC AGGGCCATGT TGATGGCTGG ATCAAAGCCC ACCAGCAGCA GTTCGATGGC TGGGTGAATG AGGCGCTGGC AGCGCAGAAG TAA
|
Protein sequence | MRHSVLFATA FATLISTQTF AADLPGKGIT VNPVQSTITE ETFQTLLVSR ALEKLGYTVN KPSEVDYNVG YTSLASGDAT FTAVNWTPLH DNMYEAAGGD KKFYREGVFV NGAAQGYLID KKTADQYKIT NIAQLKDPKI AKLFDTNGDG KADLTGCNPG WGCEGAINHQ LAAYELTHTV THNQGNYAAM MADTISRYKE GKPVFYYTWT PYWVSNELKP GKDVVWLQVP FSALPGDKNA DTKLPNGANY GFPVSTMHIV ANKAWAEKNP AAAKLFAIMQ LPVADINAQN AIMHDGKASE GDIQGHVDGW IKAHQQQFDG WVNEALAAQK
|
| |