Gene ECD_01044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01044 
SymbolmdoC 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1110351 
End bp1111508 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content44% 
IMG OID 
Productglucans biosynthesis protein 
Protein accessionACT42939 
Protein GI253977269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.314115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCAG TACCCGCGCA ACGTGAATAT TTCCTCGACT CCATCCGCGC CTGGCTGATG 
TTGTTAGGGA TACCTTTTCA TATTTCTTTA ATCTATTCGA GCCATACATG GCATGTGAAT
AGCGCCGAAT CATCATTGTG GCTGACCCTT TTTAATGACT TCATCCACTC GTTCCGCATG
CAGGTATTTT TCGTTATATC CGGCTACTTT TCCTACATGC TTTTTTTACG CTATCCCTTG
AAAAAATGGT GGAAAGTACG TGTCGAACGT GTAGGTATCC CGATGTTAAC AGCCATCCCC
CTACTGACAT TACCGCAATT TATTATGCTG CAATATGTCA AAGGAAAAGC GGAAAGTTGG
CCTGGGCTGT CATTGTATGA CAAATATAAT ACGTTGGCCT GGGAATTAAT ATCACACCTG
TGGTTTTTAC TGGTGTTAGT GGTCATGACG ACGCTGTGCG TATGGATATT TAAGCGCATC
AGAAATAATT TAGAAAATTC TGATAAAACG AATAAAAAAT TCTCGATGGT AAAACTATCG
GTGATTTTTT TATGCCTCGG CATCGGTTAT GCGGTAATAA GAAGAACGAT TTTTATTGTG
TATCCGCCCA TTCTGAGTAA TGGCATGTTC AATTTTATTG TCATGCAAAC GCTGTTTTAT
TTGCCGTTCT TTATCCTCGG CGCACTGGCT TTCATTTTCC CTCATCTTAA AGCCTTGTTT
ACCACGCCGT CTCGTGGCTG TACCCTTGCA GCAGCATTGG CGTTTGTCGC TTATTTACTC
AACCAGCGCT ATGGCAGTGG CGATGCCTGG ATGTACGAAA CCGAGTCGGT GATCACCATG
GTCCTCGGTC TGTGGATGGT GAATGTGGTC TTCTCCTTTG GCCACCGTTT GCTTAACTTC
CAGTCAGCGC GGGTGACTTA TTTTGTTAAC GCATCGCTGT TTATCTATCT GGTTCACCAC
CCGTTAACGC TGTTTTTCGG CGCATACATT ACACCGCACA TCACCTCCAA CTGGCTTGGT
TTTCTCTGTG GCCTGATATT TGTAGTAGGG ATTGCGATAA TTCTGTATGA AATTCATTTG
CGCATCCCGT TACTGAAGTT TTTGTTCTCT GGTAAACCGG TTGTTAAGCG TGAGAACGAT
AAAGCACCAG CCCGTTAA
 
Protein sequence
MNPVPAQREY FLDSIRAWLM LLGIPFHISL IYSSHTWHVN SAESSLWLTL FNDFIHSFRM 
QVFFVISGYF SYMLFLRYPL KKWWKVRVER VGIPMLTAIP LLTLPQFIML QYVKGKAESW
PGLSLYDKYN TLAWELISHL WFLLVLVVMT TLCVWIFKRI RNNLENSDKT NKKFSMVKLS
VIFLCLGIGY AVIRRTIFIV YPPILSNGMF NFIVMQTLFY LPFFILGALA FIFPHLKALF
TTPSRGCTLA AALAFVAYLL NQRYGSGDAW MYETESVITM VLGLWMVNVV FSFGHRLLNF
QSARVTYFVN ASLFIYLVHH PLTLFFGAYI TPHITSNWLG FLCGLIFVVG IAIILYEIHL
RIPLLKFLFS GKPVVKREND KAPAR