Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1118 |
Symbol | ansB |
ID | 6146805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1135375 |
End bp | 1136421 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615998 |
Product | L-asparaginase II |
Protein accession | YP_001743190 |
Protein GI | 170680917 |
COG category | [E] Amino acid transport and metabolism [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D |
TIGRFAM ID | [TIGR00520] L-asparaginases, type II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.881329 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTCT ATAAAAGAAC GGCACTTGCC GCACTGGTTA TGGGTTTTAG TGGTGCAGCA CTGGCATTAC CCAATATCAC CATTTTAGCA ACCGGCGGGA CCATTGCCGG TGGTGGTGAC TCCGCAACCA AATCTAACTA CACAGCGGGT AAAGTTGGCG TAGAAAATCT GGTTAATGCG GTGCCGCAAC TGAAGGACAT TGCGAACGTT AAAGGCGAGC AGGTAGTGAA TATCGGCTCC CAGGACATGA ACGATAATGT CTGGCTGACA CTGGCGAAAA AAATTAACAC CGACTGCGAT AAAACCGACG GCTTCGTCAT TACCCACGGT ACCGACACGA TGGAAGAAAC CGCTTACTTC CTTGACCTGA CGGTGAAATG TGACAAACCG GTGGTGATGG TCGGTGCAAT GCGCCCGTCC ACGTCCATGA GCGCAGACGG TCCCTTCAAT CTGTATAACG CGGTAGTTAC CGCAGCTGAT AAAGCCTCCG CTAATCGTGG CGTACTGGTA GTAATGAACG ACACTGTTCT GGATGGCCGT GACGTCACTA AAACCAACAC CACCAACGTA GCGACCTTCA AGTCTGTTAA CTACGGCCCT CTGGGTTACA TTCACAACGG TAAGATTGAC TACCAGCGCA CCCCGGCACG TAAGCACACC AGTGATACGC CGTTCGATGT CTCTAAGCTG AATGAACTGC CGAAAGTCGG CATTGTTTAT AACTACGCTA ACGCATCCGA TCTTCCGGCT AAAGCACTGG TAGATGCGGG CTATGATGGC ATCGTTAGCG CTGGTGTGGG TAACGGTAAC CTGTATAAAT CCGTGTTTGA CACCCTGGCG ACCGCCGCGA AAAACGGCAC TGCGGTAGTG CGTTCTTCCC GCGTACCGAC GGGAGCAACC ACTCAGGATG CCGAAGTGGA TGATGCGAAA TATGGCTTCG TCGCCTCTGG CACGTTGAAC CCGCAAAAAG CACGTGTTCT GCTGCAACTG GCTCTGACGC AAACCAAAGA TCCGCAGCAG ATCCAGCAGA TATTCAATCA GTACTAA
|
Protein sequence | MNFYKRTALA ALVMGFSGAA LALPNITILA TGGTIAGGGD SATKSNYTAG KVGVENLVNA VPQLKDIANV KGEQVVNIGS QDMNDNVWLT LAKKINTDCD KTDGFVITHG TDTMEETAYF LDLTVKCDKP VVMVGAMRPS TSMSADGPFN LYNAVVTAAD KASANRGVLV VMNDTVLDGR DVTKTNTTNV ATFKSVNYGP LGYIHNGKID YQRTPARKHT SDTPFDVSKL NELPKVGIVY NYANASDLPA KALVDAGYDG IVSAGVGNGN LYKSVFDTLA TAAKNGTAVV RSSRVPTGAT TQDAEVDDAK YGFVASGTLN PQKARVLLQL ALTQTKDPQQ IQQIFNQY
|
| |