Gene EcSMS35_4868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4868 
SymbolibeA 
ID6146360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4979543 
End bp4980913 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content46% 
IMG OID641619672 
Productinvasion protein IbeA 
Protein accessionYP_001746779 
Protein GI170682712 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.720893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTTT ATCTGGAACC CGCTCGTAAT ATACCTGTAC TGGCAACCAC GGAAGTGTTA 
GTTGTTGGTG GTGGTCCATC GGGTATTGCC GCAGCAATGA GTGCCGCTCG TGAAGGCGCA
GCTACTATGC TGATTGAACG TTTCGGTTGT TTTGGCGGAA TGATGACAAC GGCTGGCGTC
GAGTCAATTG CCTGGTGGCG TCATGAAAAT ACGGTAGAGT CAGGTGGACT GGCACGCGAA
ATAGAAGAAA CGGCAAAATC AATGGGGGCG TCCAGCCCTG AGCCGCAATC GAATAGTCAG
GCTATTAACG CCGAGCGTTT CAAACTGGTT GCGGATGCAA TGCTTGAACA GGCAGGTGTG
CGCCGCGTAC TACACATTAC CGCCGTTGAT GTTATTAAGC AGGGCAATAA TTTACTCGGC
GTAATAACAG AGAGTAAATC TGGTCGTCAG GCTATTTTGG CGAATGTCAT TATTGACTGT
ACTGGCGATG CCGATATTGC ATGGTTTGCC GGAGCACCAT TTATTAAGCG TGAACGCGAA
GAGCTAATGT GTATGACAAC CGTTTTTAGT TGCGCAAATA TAAATAAAAA CGCGTTCATG
CAAAATATTA AGAGCACGGA ACCTAAATAT GGAGACTGGG GGGCGGATGA AGAAAATAAA
AACTGGTCTT ATGATGTTCA TGAATCTTGT CGCGATATGT TTAGCCCTTA TCTGGGTAAA
GTCTTTGCGA AAGGAAAGTC GGCAGGAATT ATTCCAAAAG ATGTGACGTT AGGCGGTTCC
TGGAGTACGG TCACCGAGTA TGGTGATGCG AATTACTTGA ACGTTGTCAG CATCCCTGCC
GTCGATTGTA CGGATGTTTT TGACCTGACG CGTGCAGAAA TAGAAGGCCG CAAGCAAGCC
ATGCAGGCGA TTGAAGCGTT GCGTCAATTC CAGCCAGGAT TTGAACAGGC ACAATTAAAA
AATTTCGGTA TGACGGTGGG AACAAGAGAA TCAAGACATA TTATTGGGCG AGTCCAGCTT
ACGGAAAATG ATATTTGTAA TGAGGGACGT CATGCGGATT CAATAGGGGT ATTCCCTGAG
TTTATAGATG GAAATGGTCA TCTTAAATTA CCTCTTGAAG CGAACTATTT TCAAATCCCT
TATGGCGTAA TGATTCCGCA GCAAGTTGAA AACCTGTTGG TTTGCGGACG GGCAATCGAT
GCAGATAATT TCGCCTATGC GACAATCCGT AATATGGGGT GTTGTATTGT CACTGGAGAA
GGTGCAGGGA CTGCCGCTGC TATTGCCATT AAAAATAACA CTACCGTTTC ACAGGTAGAT
ATTCAGACGG TACAGGAACG CTTACAGCAA AATGGCGTAA AAGTCTTTTA A
 
Protein sequence
MEFYLEPARN IPVLATTEVL VVGGGPSGIA AAMSAAREGA ATMLIERFGC FGGMMTTAGV 
ESIAWWRHEN TVESGGLARE IEETAKSMGA SSPEPQSNSQ AINAERFKLV ADAMLEQAGV
RRVLHITAVD VIKQGNNLLG VITESKSGRQ AILANVIIDC TGDADIAWFA GAPFIKRERE
ELMCMTTVFS CANINKNAFM QNIKSTEPKY GDWGADEENK NWSYDVHESC RDMFSPYLGK
VFAKGKSAGI IPKDVTLGGS WSTVTEYGDA NYLNVVSIPA VDCTDVFDLT RAEIEGRKQA
MQAIEALRQF QPGFEQAQLK NFGMTVGTRE SRHIIGRVQL TENDICNEGR HADSIGVFPE
FIDGNGHLKL PLEANYFQIP YGVMIPQQVE NLLVCGRAID ADNFAYATIR NMGCCIVTGE
GAGTAAAIAI KNNTTVSQVD IQTVQERLQQ NGVKVF