Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0554 |
Symbol | allB |
ID | 6144951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 562337 |
End bp | 563698 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615448 |
Product | allantoinase |
Protein accession | YP_001742655 |
Protein GI | 170682646 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type [TIGR03178] allantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTTG ATTTAATCAT TAAAAACGGC ACCGTTATTT TAGAAAACGA AGCTCGCGTA GTGGATGTCG CCGTTAAAGG CGGAAAAATT GCTGCTATCG GTCAGGATCT GGGCGATGCA AAAGAAGTTA TGGATGCGTC TGGTCTGGTG GTTTCGCCAG GCATGGTTGA TGCGCACACC CATATTTCTG AACCGGGTCG CAGCCACTGG GAAGGTTATG AAACCGGTAC TCGCGCAGCA GCAAAAGGTG GTATCACCAC CATGATCGAA ATGCCGCTCA ACCAGCTGCC TGCAACGGTT GACCGCGCAT CGATTGAACT GAAGTTTGAT GCCGCCAAAG GCAAGCTGAC TATCGATGCG GCGCAACTCG GTGGCCTGGT GTCTTACAAC ATCGACCGTC TGCATGAGCT GGATGAAGTG GGCGTTGTCG GCTTCAAATG CTTCGTTGCG ACCTGTGGCG ATCGCGGTAT CGACAACGAC TTCCGTGACG TCAATGACTG GCAGTTCTTC AAAGGTGCGC AGAAGCTGGG CGAACTGGGA CAGCCGGTGC TGGTGCACTG CGAAAACGCG CTGATCTGTG ACGAACTTGG CGAAGAAGCG AAACGTGAAG GTCGCGTAAC CGCACATGAC TATGTGGCTT CGCGTCCGGT ATTTACCGAA GTGGAAGCGA TTCGCCGCGT GCTGTACCTG GCGAAAGTTG CCGGTTGCCG TCTGCACGTT TGCCATATCA GCAGCCCGGA AGGTGTTGAA GAAGTGACTC GTGCACGTCA GGAAGGCCAG GATGTTACCT GTGAATCCTG CCCGCATTAC TTTGTGCTGG ATACCGATCA GTTCGAAGAA ATTGGCACCC TGGCGAAGTG TTCACCGCCG ATCCGCGATC TGGAAAACCA GAAAGGCATG TGGGAAAAAC TGTTTAACGG TGAAATAGAC TGCCTGGTTT CCGACCACTC TCCATGCCCG CCGGAAATGA AAGCCGGTAA CATCATGAAA GCGTGGGGCG GTATCGCTGG TCTGCAAAGC TGCATGGACG TGATGTTCGA TGAAGCGGTA CAGAAACGCG GAATGTCTCT GCCAATGTTC GGCAAATTAA TGGCGACTAA CGCAGCAGAT ATTTTCGGTC TGCAGCAAAA AGGCCGTATC GCCCCAGGAA AAGATGCCGA CTTCGTCTTC ATTCAGCCGA ATAGCAGCTA TGTTCTTACC AATGACGATC TGGAATATCG CCACAAAGTC AGCCCGTATG TTGGCCGTAC TATTGGCGCG CGTATCACGA AAACCATCTT ACGTGGTGAT GTGATTTACG ACATCGAACA GGGCTTCCCT GTTGCGCCGA AAGGTCAATT TATCCTTAAA CATCAGCAGT AA
|
Protein sequence | MSFDLIIKNG TVILENEARV VDVAVKGGKI AAIGQDLGDA KEVMDASGLV VSPGMVDAHT HISEPGRSHW EGYETGTRAA AKGGITTMIE MPLNQLPATV DRASIELKFD AAKGKLTIDA AQLGGLVSYN IDRLHELDEV GVVGFKCFVA TCGDRGIDND FRDVNDWQFF KGAQKLGELG QPVLVHCENA LICDELGEEA KREGRVTAHD YVASRPVFTE VEAIRRVLYL AKVAGCRLHV CHISSPEGVE EVTRARQEGQ DVTCESCPHY FVLDTDQFEE IGTLAKCSPP IRDLENQKGM WEKLFNGEID CLVSDHSPCP PEMKAGNIMK AWGGIAGLQS CMDVMFDEAV QKRGMSLPMF GKLMATNAAD IFGLQQKGRI APGKDADFVF IQPNSSYVLT NDDLEYRHKV SPYVGRTIGA RITKTILRGD VIYDIEQGFP VAPKGQFILK HQQ
|
| |