Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3111 |
Symbol | |
ID | 6066317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3407035 |
End bp | 3408396 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641602527 |
Product | allantoinase |
Protein accession | YP_001726061 |
Protein GI | 170021107 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type [TIGR03178] allantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.395546 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTTG ATTTAATCAT TAAAAACGGC ACCGTTATTT TAGAAAACGA AGCTCGCGTT GTAGATATCG CCGTTAAAGG CGGAAAAATT GCTGCTATCG GTCAGGATCT GGGCGATGCA AAAGAAGTTA TGGATGCGTC TGGTCTGGTG GTTTCGCCGG GCATGGTTGA TGCGCACACC CATATTTCTG AACCGGGTCG TAGCCACTGG GAAGGTTATG AAACCGGTAC TCGCGCAGCG GCAAAAGGTG GTATCACCAC CATGATCGAA ATGCCGCTCA ACCAGCTGCC TGCAACGGTT GACCGTGCTT CAATTGAACT GAAGTTCGAT GCCGCTAAAG GCAAGCTGAC TATCGATGCG GCACAACTCG GTGGCCTGGT GTCTTACAAC ATTGATCGTC TGCATGAGTT GGATGAAGTG GGCGTTGTCG GCTTCAAATG CTTCGTTGCG ACCTGTGGCG ATCGCGGTAT CGACAACGAC TTCCGTGATG TAAACGACTG GCAGTTCTTC AAAGGTGCGC AGAAGCTGGG CGAACTGGGG CAGCCGGTGC TGGTGCACTG CGAAAACGCG CTGATCTGTG ATGCACTGGG CGAAGAAGCG AAAAGTGAAG GTCGCGTAAC TGCCCATGAC TATGTGGCTT CGCGTCCGGT ATTTACCGAA GTGGAAGCGA TTCGCCGCGT ACTGTACCTG GCGAAAGTTG CCGGTTGCCG TCTGCACATT TGCCATATCA GCAGCCCAGA AGGTGTTGAA GAAGTGACTC GTGCACGTCA GGAAGGTCAG GATGTTACTT GTGAATCCTG CCCGCATTAC TTTGTACTGG ATACCGATCA GTTCGAAGAA ATCGGTACTC TGGCGAAGTG TTCACCGCCG ATCCGCGATC TGGAAAACCA GAAAGGCATG TGGGAAAAAC TGTTTAACGG TGAAATCGAC TGCCTGGTTT CCGACCACTC ACCATGCCCT CCGGAAATGA AAGCCGGCAA CATCATGGAA GCATGGGGCG GTATTGCCGG TCTGCAAAAC TGTATGGACG TGATGTTCGA TGAAGCGGTA CAGAAACGCG GAATGTCTCT GCCAATGTTC GGCAAATTAA TGGCGACTAA CGCAGCAGAT ATTTTCGGTC TGCAGCAAAA AGGCCGTATC GCCCCAGGAA AAGATGCCGA CTTCGTCTTC ATTCAGCCGA ATAGCAGCTA TGTTCTTACC AATGACGATC TGGAATATCG CCACAAAGTC AGCCCGTATG TTGGCCGTAC CATTGGCGCG CGTATCACGA AAACCATCTT ACGTGGTGAT GTGATTTACG ACATTGAACA GGGCTTCCCT GTTGCGCCGA AAGGTCAATT TATCCTTAAA CATCAGCAGT AA
|
Protein sequence | MSFDLIIKNG TVILENEARV VDIAVKGGKI AAIGQDLGDA KEVMDASGLV VSPGMVDAHT HISEPGRSHW EGYETGTRAA AKGGITTMIE MPLNQLPATV DRASIELKFD AAKGKLTIDA AQLGGLVSYN IDRLHELDEV GVVGFKCFVA TCGDRGIDND FRDVNDWQFF KGAQKLGELG QPVLVHCENA LICDALGEEA KSEGRVTAHD YVASRPVFTE VEAIRRVLYL AKVAGCRLHI CHISSPEGVE EVTRARQEGQ DVTCESCPHY FVLDTDQFEE IGTLAKCSPP IRDLENQKGM WEKLFNGEID CLVSDHSPCP PEMKAGNIME AWGGIAGLQN CMDVMFDEAV QKRGMSLPMF GKLMATNAAD IFGLQQKGRI APGKDADFVF IQPNSSYVLT NDDLEYRHKV SPYVGRTIGA RITKTILRGD VIYDIEQGFP VAPKGQFILK HQQ
|
| |