Gene EcE24377A_3135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3135 
Symbol 
ID5586337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3142113 
End bp3143489 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content55% 
IMG OID640926777 
ProductImpA-related N-terminal domain-containing protein 
Protein accessionYP_001464150 
Protein GI157156697 
COG category[S] Function unknown 
COG ID[COG3515] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGTA ACGCGAATTT TATCAGCCAG TTCGTCATGG GCGGCGATCC CTGTACTTAT 
AAGGAATCCG GTGAACTACA GGCTGAAATG AGTAAACTGA CTCACCCGGC CCGACCTGAT
GTGGACTGGC GTCGGGTGGA AAAACTCAGC CTCGCGCTGT TCCGGCAAAA TGGCGTGGAA
TTACAGACGC AGGTCTGTTA CGTACTGGCG ATAACCAGAC GGCAGGGGCT GGCAGGGATG
GCAGACGGAC TCGGTTCACT GGACATACTG CTCCAGCGCT GGGCTGACTT CTGGCCGGTA
CAGGTACATT CCAGAATATC ACTGCTCAGT TGGGTCACAG AAAAAATGCA GCAGGCACTG
AGAACGCTGG ATATTCAGTA TCAGGATCTG CCGCAGATTT ACCGTTGTGT ACAGCATCTT
TCCGCCATCG AAACCACGCT GCAACAGTGT GAACTGTGGC ATATGACGAA ACTGGATGTG
CTGGCCGGGC AGTTTCGTAA TACGGCGCTA CGTCTGGAGC GGCTGGCACC ACAGGGAGCG
GAAACCACTA TCACTCCCCC TGAATTACCC CGCCGGGAAA TGAAGCAACC AGAAAAGTCA
GAAGAAAGTC CACAGCCGGT TTTTGCAACC AGACCCGCTC AGCAAAACGA TAAGGATGCC
AGTCCATCCG CGCCATCCCC TGAAATCTCC CGGCAGCGGA CATGGCCGAT ATTTATGGCC
GGAATGGTTG TGATGGCCTG TCTCGGCGGA ACAGGATTAT GGGGCTGGTC GCAGCTTAAT
CAGCCGGACG CACTAATCCA GCGAATACAA CTGTCTGTCA TGCCATTACC GCAGTCGCTG
GAGAGCGGCG AGCTGGCAAA GCTGGATGTA AAGGATAAGG CGCTGCTGGC TCAGGACAGA
ACAATTGCGG CAAGTCAGAT GCAACTGGAG CAGTTAAACA AATTGCCTGC CCGCTGGCCA
CTGGAGCAGG GATATCGCCA GCTACGCCAG CTTGATGCCC TGTGGCCGGA TAATCCTCAG
GTCAGAGCGC TGAACGCGCA GTGGCGTAAA CAGCGGGAGC TGAGCGCCCT GTCCACAGAG
GCATTGAATG GCTACGCACA GGCGCAGAGC CAGCTACAGC GCCTGTCGGC GCAGCTGGAT
GCACTGGATG AGCGTAAGGG GAGATATCTG ACCGGTTCGG AACTAAAAAC GGCGGTGTAC
GGCATCCGGC AGTCGTTAAA GGAGCCGCCG CTGGAAGAAC TGCTTCGGCA ACTGGAAGAG
CAAAAACAGA CCGGAGAGGT TTCGCCGACG CTGTTGACGC AAATTGATAC CCGGTTAAAT
CAGTTGCTGA ATCGCTATGT AATTTTACTG GATACGAAGG TGGAACAAAG TCAGTAA
 
Protein sequence
MASNANFISQ FVMGGDPCTY KESGELQAEM SKLTHPARPD VDWRRVEKLS LALFRQNGVE 
LQTQVCYVLA ITRRQGLAGM ADGLGSLDIL LQRWADFWPV QVHSRISLLS WVTEKMQQAL
RTLDIQYQDL PQIYRCVQHL SAIETTLQQC ELWHMTKLDV LAGQFRNTAL RLERLAPQGA
ETTITPPELP RREMKQPEKS EESPQPVFAT RPAQQNDKDA SPSAPSPEIS RQRTWPIFMA
GMVVMACLGG TGLWGWSQLN QPDALIQRIQ LSVMPLPQSL ESGELAKLDV KDKALLAQDR
TIAASQMQLE QLNKLPARWP LEQGYRQLRQ LDALWPDNPQ VRALNAQWRK QRELSALSTE
ALNGYAQAQS QLQRLSAQLD ALDERKGRYL TGSELKTAVY GIRQSLKEPP LEELLRQLEE
QKQTGEVSPT LLTQIDTRLN QLLNRYVILL DTKVEQSQ