Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1424 |
Symbol | ansA |
ID | 6143005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1408437 |
End bp | 1409453 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616302 |
Product | cytoplasmic asparaginase I |
Protein accession | YP_001743482 |
Protein GI | 170682503 |
COG category | [E] Amino acid transport and metabolism [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D |
TIGRFAM ID | [TIGR00519] L-asparaginases, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.202154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAGA AATCAATTTA CGTCGCCTAC ACGGGCGGGA CCATCGGGAT GCAGCGTTCC GAGCAGGGTT ATATTCCGGT GTCAGGTCAT CTACAGCGCC AACTGGCGCT GATGCCGGAA TTCCATCGCC CGGAGATGCC AGATTTCACC ATTCATGAAT ATACGCCGCT GATGGACTCT TCAGATATGA CGCCAGAAGA CTGGCAGCAT ATTGCTGAAG ATATTAAAGC GCATTATGAC GACTATGATG GTTTTGTCAT TCTGCACGGC ACCGACACGA TGGCGTATAC CGCCTCTGCA CTGTCGTTCA TGCTCGAGAA TCTCGGTAAA CCGGTTATTG TGACAGGGTC ACAAATCCCG CTAGCTGAGT TACGCTCTGA CGGACAAATT AATCTGCTGA ATGCGTTGTA CGTTGCGGCG AATTATCCTA TCAACGAAGT AACTCTCTTT TTTAATAATC GACTTTATCG TGGCAACCGT ACCACCAAAG CCCATGCCGA TGGTTTTGAT GCGTTTGCCT CTCCAAACCT TCCTCCGTTA CTGGAAGCAG GTATCCATAT TCGTCGTTTG AATACGCCAC CCGCCCCGCA CGGTGAAGGG GAATTGATCG TTCATCCAAT CACCCCACAA CCGATTGGCG TAGTGACGAT TTATCCGGGG ATTTCTGCTG ACGTCGTGCG CAATTTTCTG CGCCAACCGG TGAAAGCATT GATTCTGCGC TCATATGGCG TGGGTAATGC GCCACAAAAC AAAGCCTTCC TGCAAGAATT ACAAGAAGCC AGCGATCGCG GAATTGTGGT AGTCAACCTG ACACAATGTA TGTCCGGTAA AGTGAACATG GGGGGTTATG CCACCGGTAA CGCCCTCGCC CATGCCGGCG TCATTGGCGG TGCAGATATG ACTGTAGAAG CCACACTAAC CAAACTGCAT TACCTGCTTA GCCAGGAACT GGATACTGAA ACCATTCGCA AGGCCATGAG CCAAAACCTG CGTGGCGAAC TGACGCCGGA TGATTAA
|
Protein sequence | MQKKSIYVAY TGGTIGMQRS EQGYIPVSGH LQRQLALMPE FHRPEMPDFT IHEYTPLMDS SDMTPEDWQH IAEDIKAHYD DYDGFVILHG TDTMAYTASA LSFMLENLGK PVIVTGSQIP LAELRSDGQI NLLNALYVAA NYPINEVTLF FNNRLYRGNR TTKAHADGFD AFASPNLPPL LEAGIHIRRL NTPPAPHGEG ELIVHPITPQ PIGVVTIYPG ISADVVRNFL RQPVKALILR SYGVGNAPQN KAFLQELQEA SDRGIVVVNL TQCMSGKVNM GGYATGNALA HAGVIGGADM TVEATLTKLH YLLSQELDTE TIRKAMSQNL RGELTPDD
|
| |