Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0721 |
Symbol | rlpA |
ID | 6969382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 748735 |
End bp | 749823 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643384756 |
Product | rare lipoprotein A |
Protein accession | YP_002269269 |
Protein GI | 209399263 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0797] Lipoproteins |
TIGRFAM ID | [TIGR00413] rare lipoprotein A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000475341 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.474734 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAGC AGTGGCTCGG GATCTGCATC GCGGCAGGAA TGCTCGCGGC ATGTACAAGC GATGATGGTC AGCAACAGAC AGTAAGTGTA CCGCAGCCTG CGGTATGTAA CGGCCCCATA GTTGAAATTA GCGGGGCGGA CCCGCGTTTC GAACCACTGA ACGCGACGGC AAATCAGGAT TACCAGCGCG ACGGTAAAAG CTACAAAATC GTGCAGGATC CGTCTCGATT TATCCAGGCG GGACTGGCGG CAATCTATGA TGCCGAACCA GGCAGTAACC TGACGGCCTC TGGCGAAGCT TTCGATCCGA CACAGCTGAC GGCGGCCCAT CCAACGCTTC CGATCCCCAG CTACGCCAGA ATCACTAACC TGGCTAACGG GCGAATGATC GTGGTGCGCA TTAATGATCG CGGTCCTTAC GGCAACGACC GCGTTATTTC GCTTTCTCGC GCAGCAGCTG ACCGTCTTAA CACGTCAAAC AACACCAAAG TTCGTATCGA TCCGATTATT GTCGCCCAGG ATGGTTCGCT TTCTGGTCCT GGTATGGCTT GTACCACAGT CGCCAAACAG ACTTACGCCC TGCCTGCACC TCCCGATTTA AGCGGTGGCG CGGGAACAAG TTCAGTGTCT GGCCCGCAGG GTGACATTCT TCCGGTCAGT AATTCGACGC TAAAAAGCGA AGATCCGACC GGCGCGCCGG TAACCAGCAG CGGTTTCCTC GGCGCACCAA CGACCTTAGC GCCAGGTGTA CTGGAAGGCA GCGAACCGAC GCCTGCTCCA CAGCCCGTTG TTACAGCTCC GTCGACAACG CCTGCAACCT CGCCTGCAAT GGTGACACCG CAAGCCGCCT CGCAAAGCGC CAGCGGCAAC TTTATGGTGC AGGTCGGGGC CGTAAGCGAT CAGGCTCGTG CGCAACAGTA CCAACAGCAA CTGGGACAGA AGTTCGGCGT CCCCGGTCGC GTAACTCAAA ATGGCGCGGT CTGGCGGATC CAGCTTGGCC CATTCGCCAA CAAAGCCGAA GCCAGTACCT TGCAGCAACG TTTGCAAACC GAAGCCCAAT TACAGTCATT TATTACCACC GCGCAGTAG
|
Protein sequence | MRKQWLGICI AAGMLAACTS DDGQQQTVSV PQPAVCNGPI VEISGADPRF EPLNATANQD YQRDGKSYKI VQDPSRFIQA GLAAIYDAEP GSNLTASGEA FDPTQLTAAH PTLPIPSYAR ITNLANGRMI VVRINDRGPY GNDRVISLSR AAADRLNTSN NTKVRIDPII VAQDGSLSGP GMACTTVAKQ TYALPAPPDL SGGAGTSSVS GPQGDILPVS NSTLKSEDPT GAPVTSSGFL GAPTTLAPGV LEGSEPTPAP QPVVTAPSTT PATSPAMVTP QAASQSASGN FMVQVGAVSD QARAQQYQQQ LGQKFGVPGR VTQNGAVWRI QLGPFANKAE ASTLQQRLQT EAQLQSFITT AQ
|
| |