Gene EcE24377A_3330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3330 
Symbol 
ID5586991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3345110 
End bp3346396 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content45% 
IMG OID640926964 
Producthypothetical protein 
Protein accessionYP_001464335 
Protein GI157155806 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAG TTATCTCAAT TTTTGAATAT GACCTGTTGG GGAGTGGAAA AGCGGCTTCA 
GTTGGCGCAA AGTTGGTGCC TCAACAGGTC TTTGATTATT TAGAAACACT TAGCCTAACA
AGTGAGCAAG GAAGCCAGTT TCTCAAACTC ACCTCTCGTT CAGGTTTCAA ATTGCTGCAG
GTACAAAACT ACGCGGGGAT GCTTTCTACA CCTCACGGTT TGCAGCTTGA GATACTGCCC
AAGATTGGTA AAAACCTCAC GCTTGCGAGT GCGCGAGAAA CATTCATTAC GATGCTAAGT
CACCTACCTA GGTTTCGACA CATTCAAACC CAGCAGGCCA CTCTTCAGGC GCAACGCATG
CCTTTGCTCG AGATTTTTAT CAGCCAATTT TTGCAAAGCG TCAGTCAATT GCTTAAACAA
GGTTTACGCT CCGACTATGT GAGTGAGAAA GGTAATCTGG CTTTTATGAA GGGCAAACTG
ATGCTCTCTG CACAACTGCG ACATAACGCG GTGAATCGCC ATAAGTTTTG TGTCGATTAT
GATGAGTATA TGCCTGATTG TGCTGCCAAT CGGCTCTTAC ACTCCACACT GGATAAGTTA
CTTAGTCTGA AGCTGTCATC AGAGAATCAA CGCTGGCTTT ACGAACTGTG CTTTGCTTTT
GACGGTATTC CACTTAGTAG GGATATAGAG AGCGATCTGA ACAGCTTACG CATTGAGCGT
GGTATGACTC ATTACAGCGA GCCAATAGCT TGGGCGCAGT TGATCCTGAG AGGAATGAGC
CCAAGTGCAT TGCAAGGAAA CACCAAAGCG ATATCACTTT TGTTTCCTAT GGAAGCGGTA
TTTGAATCCT TTGTGGCACA GACCCTACCC TACGAATTAC CTTCTCACCT AAAAGTTTTT
TCTCAAGCAG CAACGTATTC TTTGGTAAAG CATGGACTCA AAGATTGCTT TAAGCTTCGC
CCAGACTTGC TGATTCAATC TCGGCAACCG ATTCAAACCA AAATGGTGAT GGATACAAAA
TGGAAGCAGG TGAATAGCAG CCAGCAAAAA AAATCACTTT ATGGGCTAGC GCAATCCGAC
TTCTATCAAA TGTTTGCCTA CGGCCAAAAA TACCTTGGCG GAACAGGCGA AATGTACCTG
ATTTACCCTG CGCATGATGA CTTTAGCCAA CCGATACCGC AGCACTTTGC TTTTTCAGAG
ACTTTAAAAT TATGGGTTGT GCCATACCGG ATAATGGCAA AACGTGGTGA GAGGATGATG
TGGGCAAGTG ATGTGTTAGC TACATAG
 
Protein sequence
MSEVISIFEY DLLGSGKAAS VGAKLVPQQV FDYLETLSLT SEQGSQFLKL TSRSGFKLLQ 
VQNYAGMLST PHGLQLEILP KIGKNLTLAS ARETFITMLS HLPRFRHIQT QQATLQAQRM
PLLEIFISQF LQSVSQLLKQ GLRSDYVSEK GNLAFMKGKL MLSAQLRHNA VNRHKFCVDY
DEYMPDCAAN RLLHSTLDKL LSLKLSSENQ RWLYELCFAF DGIPLSRDIE SDLNSLRIER
GMTHYSEPIA WAQLILRGMS PSALQGNTKA ISLLFPMEAV FESFVAQTLP YELPSHLKVF
SQAATYSLVK HGLKDCFKLR PDLLIQSRQP IQTKMVMDTK WKQVNSSQQK KSLYGLAQSD
FYQMFAYGQK YLGGTGEMYL IYPAHDDFSQ PIPQHFAFSE TLKLWVVPYR IMAKRGERMM
WASDVLAT