Gene EcE24377A_4024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4024 
Symbol 
ID5586001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4008387 
End bp4009892 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content49% 
IMG OID640927645 
Producthypothetical protein 
Protein accessionYP_001465006 
Protein GI157156797 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCAG GCGGCGTCTG GTGGTTTAAC GTCGATCGCC ATGAAGATGC TATCAGTCTG 
GCGAATCAAA CAATTGCATC CCAGGCTGCA ACCGCACACG TCGCGGTCAT TAGCATGGAC
AGCGATCCGG CGAAAATCTT TCAATTAGAT GATTCTCAAG GGCCGGAAAA AATAAAATTA
TTTTCAATGC TAAATCATGA AAAAGGTCTA TACTATTTGA CCCGTGATTT GCAGTGTTCT
ATTGATCCCC ATAATTACCT TTTTATTCTT GTTTGCGCAA ATAACGCATG GCAAAACATT
CCTGCCGAGC GGCTTCGCTC ATGGTTGGAT AAAATGAATA AATGGAGCAG GTTAAACCAT
TGTTCGCTTT TGGTAATTAA TCCCGGAAAT AATAACGATA AACAATTTTC ATTGTTGCTT
GAGGAATACC GTTCACTTTT TGGTCTTGCC AGTTTGCGTT TTCAGGGTGA CCAACATTTG
CTGGATATTG CCTTCTGGTG CAACGAAAAA GGGGTCAGCG CCCGTCAGCA GCTTAGCGTT
CAGCAACAAA ATGGTATCTG GACATTAGTT CAAAGCGAAG AGGCGGAGAT CCAACCACGC
AGCGACGAAA AACGCATTCT GAGTAATGTT GCTGTACTGG AAGGTGCGCC GCCGCTATCG
GAACACTGGC AACTGTTCAA CAATAACGAA GTCCTGTTCA ATGAAGCCCG TACCGCTCAG
GCGGCGACGG TGGTCTTTTC TTTACAGCAA AATGCGCAAA TCGAGCCACT GGCCCGCAGC
ATTCATACCC TGCGTCGCCA GCGCGGTAGT GCGATGAAAA TCCTCGTGCG GGAAAATACC
GCTAGCCTGC GCGCCACCGA TGAACGTTTG TTATTGGCCT GCGGTGCAAA TATGGTTATT
CCGTGGAATG CGCCACTCTC CCGTTGTCTG ACGATGATCG AAAGCGTGCA AGGGCAGAAG
TTTAGTCGCT ATGTGCCGGA AGATATCACT ACCTTGCTGT CAATGACCCA GCCGCTCAAA
CTGCGTGGTT TCCAGAAGTG GGATGTGTTC TGTAATGCCG TCAACAACAT GATGAATAAC
CCTCTCTTGC CTGCCCACGG TAAAGGCGTT CTGGTTGCCC TACGTCCGGT ACCGGGTATC
CGCGTTGAAC AAGCCCTGAC GCTGTGTCGC CCTAACCGTA CCGGCGATAT CATGACCATT
GGCGGTAATC GGCTGGTGCT GTTTCTCTCA TTCTGTCGGA TTAACGATCT GGATACCGCG
TTGAATCATA TTTTCCCATT GCCTACTGGC GACATTTTCT CAAACCGTAT GGTCTGGTTT
GAAGATGATC AAATCAGTGC CGAGCTGGTG CAGATGCGCT TGCTTGCCCC AGAACAATGG
GGCATGCCGC TGCCTTTAAC GCAAAGTTCT AAACCGGTCA TCAATGCCGA GCACGATGGT
CGCCACTGGC GACGAATACC AGAACCCATG CGACTGTTAG ATGATGCTGT GGAGCGCTCA
TCATGA
 
Protein sequence
MPAGGVWWFN VDRHEDAISL ANQTIASQAA TAHVAVISMD SDPAKIFQLD DSQGPEKIKL 
FSMLNHEKGL YYLTRDLQCS IDPHNYLFIL VCANNAWQNI PAERLRSWLD KMNKWSRLNH
CSLLVINPGN NNDKQFSLLL EEYRSLFGLA SLRFQGDQHL LDIAFWCNEK GVSARQQLSV
QQQNGIWTLV QSEEAEIQPR SDEKRILSNV AVLEGAPPLS EHWQLFNNNE VLFNEARTAQ
AATVVFSLQQ NAQIEPLARS IHTLRRQRGS AMKILVRENT ASLRATDERL LLACGANMVI
PWNAPLSRCL TMIESVQGQK FSRYVPEDIT TLLSMTQPLK LRGFQKWDVF CNAVNNMMNN
PLLPAHGKGV LVALRPVPGI RVEQALTLCR PNRTGDIMTI GGNRLVLFLS FCRINDLDTA
LNHIFPLPTG DIFSNRMVWF EDDQISAELV QMRLLAPEQW GMPLPLTQSS KPVINAEHDG
RHWRRIPEPM RLLDDAVERS S