Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0548 |
Symbol | allB |
ID | 5586304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 574102 |
End bp | 575463 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640924270 |
Product | allantoinase |
Protein accession | YP_001461697 |
Protein GI | 157155927 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type [TIGR03178] allantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTTG ATTTAATCAT TAAAAACGGC ACCGTTATTT TAGAAAACGA AGCTCGCGTT GTAGATATCG CCGTTAAAGG CGGGAAAATT GCTGCTATCG GTCAGGATCT GGGCGATGCA AAAGAAGTTA TGGATGCGTC TGGTCTGGTG GTTTCGCCGG GCATGGTTGA TGCGCACACC CATATTTCTG AACCGGGTCG TAGCCACTGG GAAGGTTATG AAACCGGTAC TCGCGCAGCG GCAAAAGGTG GTATCACCAC CATGATCGAA ATGCCGCTCA ACCAGCTGCC TGCAACGGTT GACCGCGCTT CAATTGAACT GAAGTTCGAT GCCGCTAAAG GCAAGCTGAC TATCGATGCG GCACAACTCG GTGGCCTGGT GTCTTACAAC ATCGACCGTC TGCATGAGCT GGATGAAGTG GGCGTTGTCG GCTTCAAATG CTTCGTTGCG ACCTGTGGCG ATCGCGGTAT CGACAACGAC TTCCGTGATG TAAACGACTG GCAGTTCTTC AAAGGTGCGC AGAAGCTGGG CGAACTGGGG CAGCCGGTGC TGGTGCACTG CGAAAACGCG CTGATCTGTG ACGCACTGGG CGAAGAAGCG AAACGTGAAG GTCGCGTAAC TGCCCATGAC TATGTGGCTT CGCGTCCGGT ATTTACCGAA GTGGAAGCGA TTCGCCGCGT GCTGTATCTG GCGAAAGTTG CCGGTTGCCG TCTGCACGTT TGCCATATCA GCAGCCCGGA AGGCGTTGAA GAAGTGACTC GTGCACGTCA GGAAGGTCAG GACGTTACTT GTGAATCCTG CCCGCATTAC TTTGTACTGG ATACCGATCA GTTCGAAGAA ATCGGCACTC TGGCGAAGTG TTCACCGCCG ATCCGCGATC TGGAAAACCA GAAAGGCATG TGGGAAAAAC TGTTTAACGG TGAAATCGAC TGCCTGGTTT CCGACCACTC ACCATGCCCT CCGGAAATGA AAGCCGGCAA CATCATGGAA GCATGGGGCG GTATTGCCGG CCTGCAAAAC TGTATGGACG TGATGTTCGA TGAAGCGGTA CAGAAACGCG GAATGTCTCT GCCAATGTTC GGCAAATTAA TGGCGACTAA CGCAGCAGAT ATTTTCGGTC TGCAGCAAAA AGGCCGTATC GCCCTAGGAA AAGATGCCGA CTTCGTCTTC ATTCAGCCGA ATAGCAGCTA TGTTCTTACC AATGACGATC TGGAATATCG CCACAAAGTC AGCCCGTATG TTGGCCGTAC CATTGGCGCG CGTATCACGA AAACCATCTT ACGTGGTGAT GTGATTTACG ACATTGAACA GGGCTTCCCT GTTGCGCCGA AAGGTCAATT TATCCTTAAA CATCAGCAGT AA
|
Protein sequence | MSFDLIIKNG TVILENEARV VDIAVKGGKI AAIGQDLGDA KEVMDASGLV VSPGMVDAHT HISEPGRSHW EGYETGTRAA AKGGITTMIE MPLNQLPATV DRASIELKFD AAKGKLTIDA AQLGGLVSYN IDRLHELDEV GVVGFKCFVA TCGDRGIDND FRDVNDWQFF KGAQKLGELG QPVLVHCENA LICDALGEEA KREGRVTAHD YVASRPVFTE VEAIRRVLYL AKVAGCRLHV CHISSPEGVE EVTRARQEGQ DVTCESCPHY FVLDTDQFEE IGTLAKCSPP IRDLENQKGM WEKLFNGEID CLVSDHSPCP PEMKAGNIME AWGGIAGLQN CMDVMFDEAV QKRGMSLPMF GKLMATNAAD IFGLQQKGRI ALGKDADFVF IQPNSSYVLT NDDLEYRHKV SPYVGRTIGA RITKTILRGD VIYDIEQGFP VAPKGQFILK HQQ
|
| |