Gene ECD_00431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00431 
SymbolushA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp473960 
End bp475612 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content51% 
IMG OID 
ProductUDP-sugar hydrolase 
Protein accessionACT42330 
Protein GI253976660 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.373208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAT TGCAGCGGGG CGTGGCGTTA GCGCTGTTAA CCACATTTAC ACTGGCGAGT 
GAAACTGCTC TGGCGTATGA GCAGGATAAA ACCTACAAAA TTACAGTTCT GCATACCAAT
GATCATCATG GGCATTTTTG GCGCAATGAA TATGGCGAAT ATGGTCTGGC GGCGCAAAAA
ACGCTGGTGG ATGGTATCCG CAAAGAGGTT GCGGCTGAAG GCGGTAGCGT GCTGCTACTT
TCCGGTGGCG ACATTAACAC TGGCGTGCCC GAGTCTGACT TACAGGATGC CGAACCTGAT
TTTCGCGGTA TGAATCTGGT GGGCTATGAC GCGATGGCGA TCGGTAATCA TGAATTTGAT
AATCCGCTCA CCGTATTACG CCAGCAGGAA AAGTGGGCCA AGTTCCCGTT GCTTTCCGCG
AATATCTACC AGAAAAGTAC TGGCGAGCGC CTGTTTAAAC CGTGGGCGCT GTTTAAGCGT
CAGGATCTGA AAATTGCCGT TATTGGGCTG ACAACCGATG ACACAGCAAA AATTGGTAAC
CCGGAATACT TCACTGATAT CGAATTTCGT AAGCCCGCCG ATGAAGCGAA GCTGGTGATT
CAGGAGCTGC AACAGACAGA AAAGCCAGAC ATTATTATCG CGGCGACCCA TATGGGGCAT
TACGATAATG GTGAGCACGG CTCTAACGCA CCGGGCGATG TGGAGATGGC ACGCGCGCTG
CCTGCCGGAT CGCTGGCGAT GATCGTCGGT GGTCACTCGC AAGATCCGGT CTGCATGGCG
GCAGAAAACA AAAAACAGGT CGATTACGTG CCGGGTACGC CATGCAAACC AGATCAACAA
AACGGCATCT GGATTGTGCA GGCGCATGAG TGGGGCAAAT ACGTGGGACG GGCTGATTTT
GAGTTTCGTA ATGGCGAAAT GAAAATGGTT AACTACCAGC TGATTCCGGT GAACCTGAAG
AAGAAAGTGA CCTGGGAAGA CGGGAAAAGC GAGCGCGTGC TTTACACTCC TGAAATCGCT
GAAAACCAGC AAATGATCTC GCTGTTATCA CCGTTCCAGA ACAAAGGCAA AGCGCAGCTG
GAAGTGAAAA TAGGCGAAAC CAATGGTCGT CTGGAAGGCG ATCGTGACAA AGTGCGTTTT
GTACAGACCA ATATGGGGCG GTTGATTCTG GCAGCCCAAA TGGATCGCAC TGGTGCCGAC
TTTGCGGTGA TGAGCGGAGG CGGAATTCGT GATTCTATCG AAGCAGGCGA TATCAGCTAT
AAAAACGTGC TGAAAGTGCA GCCATTCGGC AATGTGGTGG TGTATGCCGA CATGACCGGT
AAAGAGGTGA TTGATTACCT GACCGCCGTC GCGCAGATGA AGCCAGATTC AGGTGCCTAC
CCGCAATTTG CCAACGTTAG CTTTGTGGCG AAAGACGGCA AACTGAACGA CCTTAAAATC
AAAGGCGAAC CGGTCGATCC GGCGAAAACT TACCGTATGG CGACATTAAA CTTCAATGCC
ACCGGCGGTG ATGGATATCC GCGCCTTGAT AACAAACCGG GCTATGTGAA TACCGGCTTT
ATTGATGCCG AAGTGCTGAA AGCGTATATC CAGAAAAGCT CGCCGCTGGA TGTGAGTGTT
TATGAACCGA AAGGTGAGGT GAGCTGGCAG TAA
 
Protein sequence
MKLLQRGVAL ALLTTFTLAS ETALAYEQDK TYKITVLHTN DHHGHFWRNE YGEYGLAAQK 
TLVDGIRKEV AAEGGSVLLL SGGDINTGVP ESDLQDAEPD FRGMNLVGYD AMAIGNHEFD
NPLTVLRQQE KWAKFPLLSA NIYQKSTGER LFKPWALFKR QDLKIAVIGL TTDDTAKIGN
PEYFTDIEFR KPADEAKLVI QELQQTEKPD IIIAATHMGH YDNGEHGSNA PGDVEMARAL
PAGSLAMIVG GHSQDPVCMA AENKKQVDYV PGTPCKPDQQ NGIWIVQAHE WGKYVGRADF
EFRNGEMKMV NYQLIPVNLK KKVTWEDGKS ERVLYTPEIA ENQQMISLLS PFQNKGKAQL
EVKIGETNGR LEGDRDKVRF VQTNMGRLIL AAQMDRTGAD FAVMSGGGIR DSIEAGDISY
KNVLKVQPFG NVVVYADMTG KEVIDYLTAV AQMKPDSGAY PQFANVSFVA KDGKLNDLKI
KGEPVDPAKT YRMATLNFNA TGGDGYPRLD NKPGYVNTGF IDAEVLKAYI QKSSPLDVSV
YEPKGEVSWQ