Gene EcE24377A_4938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4938 
Symbol 
ID5590314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4922870 
End bp4924150 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content52% 
IMG OID640928538 
Producthypothetical protein 
Protein accessionYP_001465865 
Protein GI157158478 
COG category[S] Function unknown 
COG ID[COG2733] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC TCATTGAACT CAGACGCGCC AAAATGTTGG CGCTCTCTTT ACTGCTTATC 
GCCGCTGCTA CCTTTGTCGT TACGCTGTTT TTGCCGCCCA ATTTTTGGGT GAGCGGCGTG
AAGGCGATTG CTGAAGCGGC GATGGTCGGC GCGCTGGCGG ACTGGTTTGC GGTGGTGGCG
CTGTTTCGCC GCGTGCCGAT TCCGATCATT TCTCGTCATA CGGCGATTAT TCCGCGTAAT
AAAGACCGGA TTGGCGAAAA TCTCGGCCAG TTCGTGCAGG AAAAATTTCT CGATACCCAA
TCGCTGGTGG CATTGATTCG ACGTCACGAA CCGGCGTTGT TGATTGGCAA CTGGTTTAGT
CAGCCAGAAA ACGCCCGCCG CGTTGGTCAG CATCTGTTGC AGATCATGAG CGGTTTTCTT
GAACTAACCG ATGATGCGCG TATTCAGCGC CTGCTTAAGC GCGCGGTCCA CCGGGCGATT
GATAAAGTCG ATCTTTCTGG CACCAGTGCG TTGATGCTGG AGAGTATGAC TAAAAACGAT
CGTCATCAGG TGCTGCTGGA TACGCTGATC GCACAGTTGA TCGCCCTTCT CCAGCGCGAT
AAATCGCGCA AGTTTATTGC CCAGCAAATT GTTCGCTGGC TGGAGAGCGA GCATCCACTG
AAAGCCAAAA TTTTGCCCAC TGAATGGCTG GGCGAACATA GCGCGGAGTT GGTTTCTGAC
GCGGTGAATT CTTTGCTTGA TGATATCAGC CGCGATCGTG CGCATCAGAT CCGCCATGCG
TTTGATCGCG CTACCTTCGC CCTGATCGAG AAACTGAAAA ACGATCCGGA AATGGCGGCG
CGAGCCGATG CCGTAAAAAG CTATCTGAAA GAAGATGAAG CTTTTAACCG CTATCTCAGT
GAATTGTGGG GGGATTTACG GAAATGGCTG AAAGCGGATA TCAACAGTGA AGATTCTCGT
GTGAAAGAAC GTATCGCGCG GGCTGGTCAA TGGTTTGGCG AAACGTTAAT TGCCGATGAT
GCCTTGCGGG CGTCGTTAAA TGGTCATCTG GAACAAGCCG CACACCGCAT TGCGCCTGAG
TTTTCCGCAT TCCTGACTCG CCATATCAGC GACACGGTAA AAAGCTGGGA CGCACGGGAC
ATGTCGCGGC AAATCGAGTT AAATATCGGC AAAGATCTCC AGTTTATCCG TGTCAACGGT
ACGCTGGTTG GCGGTTGTAT TGGACTTATT TTGTATTTGC TGTCGCAGCT CCCGGCCTTG
TTCCCCCTCG GCAATTTATA G
 
Protein sequence
MNKLIELRRA KMLALSLLLI AAATFVVTLF LPPNFWVSGV KAIAEAAMVG ALADWFAVVA 
LFRRVPIPII SRHTAIIPRN KDRIGENLGQ FVQEKFLDTQ SLVALIRRHE PALLIGNWFS
QPENARRVGQ HLLQIMSGFL ELTDDARIQR LLKRAVHRAI DKVDLSGTSA LMLESMTKND
RHQVLLDTLI AQLIALLQRD KSRKFIAQQI VRWLESEHPL KAKILPTEWL GEHSAELVSD
AVNSLLDDIS RDRAHQIRHA FDRATFALIE KLKNDPEMAA RADAVKSYLK EDEAFNRYLS
ELWGDLRKWL KADINSEDSR VKERIARAGQ WFGETLIADD ALRASLNGHL EQAAHRIAPE
FSAFLTRHIS DTVKSWDARD MSRQIELNIG KDLQFIRVNG TLVGGCIGLI LYLLSQLPAL
FPLGNL