Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4868 |
Symbol | ibeA |
ID | 6146360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4979543 |
End bp | 4980913 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641619672 |
Product | invasion protein IbeA |
Protein accession | YP_001746779 |
Protein GI | 170682712 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.720893 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTTT ATCTGGAACC CGCTCGTAAT ATACCTGTAC TGGCAACCAC GGAAGTGTTA GTTGTTGGTG GTGGTCCATC GGGTATTGCC GCAGCAATGA GTGCCGCTCG TGAAGGCGCA GCTACTATGC TGATTGAACG TTTCGGTTGT TTTGGCGGAA TGATGACAAC GGCTGGCGTC GAGTCAATTG CCTGGTGGCG TCATGAAAAT ACGGTAGAGT CAGGTGGACT GGCACGCGAA ATAGAAGAAA CGGCAAAATC AATGGGGGCG TCCAGCCCTG AGCCGCAATC GAATAGTCAG GCTATTAACG CCGAGCGTTT CAAACTGGTT GCGGATGCAA TGCTTGAACA GGCAGGTGTG CGCCGCGTAC TACACATTAC CGCCGTTGAT GTTATTAAGC AGGGCAATAA TTTACTCGGC GTAATAACAG AGAGTAAATC TGGTCGTCAG GCTATTTTGG CGAATGTCAT TATTGACTGT ACTGGCGATG CCGATATTGC ATGGTTTGCC GGAGCACCAT TTATTAAGCG TGAACGCGAA GAGCTAATGT GTATGACAAC CGTTTTTAGT TGCGCAAATA TAAATAAAAA CGCGTTCATG CAAAATATTA AGAGCACGGA ACCTAAATAT GGAGACTGGG GGGCGGATGA AGAAAATAAA AACTGGTCTT ATGATGTTCA TGAATCTTGT CGCGATATGT TTAGCCCTTA TCTGGGTAAA GTCTTTGCGA AAGGAAAGTC GGCAGGAATT ATTCCAAAAG ATGTGACGTT AGGCGGTTCC TGGAGTACGG TCACCGAGTA TGGTGATGCG AATTACTTGA ACGTTGTCAG CATCCCTGCC GTCGATTGTA CGGATGTTTT TGACCTGACG CGTGCAGAAA TAGAAGGCCG CAAGCAAGCC ATGCAGGCGA TTGAAGCGTT GCGTCAATTC CAGCCAGGAT TTGAACAGGC ACAATTAAAA AATTTCGGTA TGACGGTGGG AACAAGAGAA TCAAGACATA TTATTGGGCG AGTCCAGCTT ACGGAAAATG ATATTTGTAA TGAGGGACGT CATGCGGATT CAATAGGGGT ATTCCCTGAG TTTATAGATG GAAATGGTCA TCTTAAATTA CCTCTTGAAG CGAACTATTT TCAAATCCCT TATGGCGTAA TGATTCCGCA GCAAGTTGAA AACCTGTTGG TTTGCGGACG GGCAATCGAT GCAGATAATT TCGCCTATGC GACAATCCGT AATATGGGGT GTTGTATTGT CACTGGAGAA GGTGCAGGGA CTGCCGCTGC TATTGCCATT AAAAATAACA CTACCGTTTC ACAGGTAGAT ATTCAGACGG TACAGGAACG CTTACAGCAA AATGGCGTAA AAGTCTTTTA A
|
Protein sequence | MEFYLEPARN IPVLATTEVL VVGGGPSGIA AAMSAAREGA ATMLIERFGC FGGMMTTAGV ESIAWWRHEN TVESGGLARE IEETAKSMGA SSPEPQSNSQ AINAERFKLV ADAMLEQAGV RRVLHITAVD VIKQGNNLLG VITESKSGRQ AILANVIIDC TGDADIAWFA GAPFIKRERE ELMCMTTVFS CANINKNAFM QNIKSTEPKY GDWGADEENK NWSYDVHESC RDMFSPYLGK VFAKGKSAGI IPKDVTLGGS WSTVTEYGDA NYLNVVSIPA VDCTDVFDLT RAEIEGRKQA MQAIEALRQF QPGFEQAQLK NFGMTVGTRE SRHIIGRVQL TENDICNEGR HADSIGVFPE FIDGNGHLKL PLEANYFQIP YGVMIPQQVE NLLVCGRAID ADNFAYATIR NMGCCIVTGE GAGTAAAIAI KNNTTVSQVD IQTVQERLQQ NGVKVF
|
| |