Gene ECH74115_1427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1427 
SymbolmdoC 
ID6971015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1410512 
End bp1411669 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content44% 
IMG OID643385401 
Productglucans biosynthesis protein 
Protein accessionYP_002269895 
Protein GI209400526 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0273757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.379852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAG TACCCGCGCA ACGTGAATAT TTCCTCGACT CCATCCGCGC CTGGCTGATG 
TTGTTAGGGA TCCCTTTTCA TATTTCTTTA ATCTATTCGA GCCATACATG GCATGTGAAT
AGCGCCGAAC CGTCATTGTG GCTGACCCTT TTTAATGACT TCATCCACTC GTTCCGCATG
CAGGTATTTT TCGTTATATC CGGGTACTTT TCCTACATGC TTTTTTTACG CTATCCCTTG
AAAAAATGGT GGAAAGTACG TGTCGAACGT GTAGGGATCC CGATGTTAAC AGCCATCCCC
CTACTGACAT TGCCGCAATT TATTATGCTG CAATACGTCA AAGGGAAAGC GGAAAGTTGG
CCTGGGCTGT CATTGTATGA CAAATATAAT ACGTTGGCCT GGGAATTAAT ATCACATCTG
TGGTTTTTAC TGGTGTTAGT GGTCATGACG ACGCTGTGCG TATGGATATT TAAACGCATC
AGAAATAATT TAGAAAATTC TGATAAAACG AATAAAAAAT TCTCGATGGT AAAACTATCG
GTGATTTTTT TGTGCCTCGG CATCGGTTAT GCGGTAATAA GAAGAACGAT TTTTATTGTG
TATCCACCCA TTCTGAGTAA TGGCACGTTC AATTTTATTG TCATGCAAAC GCTATTTTAT
TTACCGTTCT TTATCCTCGG CGCACTGGCT TTCATTTTCC CTCATCTTAA AGCCTTGTTT
ACCACGCCGT CTCGTGGCTG TACCCTCGCT GCAGCATTGG CGTTTGTCGC TTATTTACTC
AACCAGCGCT ATGGCAGTGG CGATGCCTGG ATGTACGAAA CCGAGTCGGT GATCACCATG
GTCCTCGGTC TGTGGATGGT GAATGTGGTC TTCTCCTTTG GCCACCGTTT GCTTAACTTC
CAGTCAGCGC GGGTGACTTA TTTTGTTAAC GCATCGCTGT TTATCTATCT GGTTCACCAC
CCGTTAACGC TGTTTTTCGG CGCATACATT ACACCGCACA TCACCTCCAA CTGGCTTGGT
TTTCTCTGTG GCCTGATATT CGTAGTAGGG ATTGCGATAA TTCTGTATGA AATTCATTTG
CGCATCCCGT TACTGAAGTT TTTGTTCTCT GGTAAACCGG TTGTTAAGCG TGAGAACGAT
AAAGCACCAG CCCGTTAA
 
Protein sequence
MNPVPAQREY FLDSIRAWLM LLGIPFHISL IYSSHTWHVN SAEPSLWLTL FNDFIHSFRM 
QVFFVISGYF SYMLFLRYPL KKWWKVRVER VGIPMLTAIP LLTLPQFIML QYVKGKAESW
PGLSLYDKYN TLAWELISHL WFLLVLVVMT TLCVWIFKRI RNNLENSDKT NKKFSMVKLS
VIFLCLGIGY AVIRRTIFIV YPPILSNGTF NFIVMQTLFY LPFFILGALA FIFPHLKALF
TTPSRGCTLA AALAFVAYLL NQRYGSGDAW MYETESVITM VLGLWMVNVV FSFGHRLLNF
QSARVTYFVN ASLFIYLVHH PLTLFFGAYI TPHITSNWLG FLCGLIFVVG IAIILYEIHL
RIPLLKFLFS GKPVVKREND KAPAR