Gene EcSMS35_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1424 
SymbolansA 
ID6143005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1408437 
End bp1409453 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content50% 
IMG OID641616302 
Productcytoplasmic asparaginase I 
Protein accessionYP_001743482 
Protein GI170682503 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00519] L-asparaginases, type I 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.202154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGA AATCAATTTA CGTCGCCTAC ACGGGCGGGA CCATCGGGAT GCAGCGTTCC 
GAGCAGGGTT ATATTCCGGT GTCAGGTCAT CTACAGCGCC AACTGGCGCT GATGCCGGAA
TTCCATCGCC CGGAGATGCC AGATTTCACC ATTCATGAAT ATACGCCGCT GATGGACTCT
TCAGATATGA CGCCAGAAGA CTGGCAGCAT ATTGCTGAAG ATATTAAAGC GCATTATGAC
GACTATGATG GTTTTGTCAT TCTGCACGGC ACCGACACGA TGGCGTATAC CGCCTCTGCA
CTGTCGTTCA TGCTCGAGAA TCTCGGTAAA CCGGTTATTG TGACAGGGTC ACAAATCCCG
CTAGCTGAGT TACGCTCTGA CGGACAAATT AATCTGCTGA ATGCGTTGTA CGTTGCGGCG
AATTATCCTA TCAACGAAGT AACTCTCTTT TTTAATAATC GACTTTATCG TGGCAACCGT
ACCACCAAAG CCCATGCCGA TGGTTTTGAT GCGTTTGCCT CTCCAAACCT TCCTCCGTTA
CTGGAAGCAG GTATCCATAT TCGTCGTTTG AATACGCCAC CCGCCCCGCA CGGTGAAGGG
GAATTGATCG TTCATCCAAT CACCCCACAA CCGATTGGCG TAGTGACGAT TTATCCGGGG
ATTTCTGCTG ACGTCGTGCG CAATTTTCTG CGCCAACCGG TGAAAGCATT GATTCTGCGC
TCATATGGCG TGGGTAATGC GCCACAAAAC AAAGCCTTCC TGCAAGAATT ACAAGAAGCC
AGCGATCGCG GAATTGTGGT AGTCAACCTG ACACAATGTA TGTCCGGTAA AGTGAACATG
GGGGGTTATG CCACCGGTAA CGCCCTCGCC CATGCCGGCG TCATTGGCGG TGCAGATATG
ACTGTAGAAG CCACACTAAC CAAACTGCAT TACCTGCTTA GCCAGGAACT GGATACTGAA
ACCATTCGCA AGGCCATGAG CCAAAACCTG CGTGGCGAAC TGACGCCGGA TGATTAA
 
Protein sequence
MQKKSIYVAY TGGTIGMQRS EQGYIPVSGH LQRQLALMPE FHRPEMPDFT IHEYTPLMDS 
SDMTPEDWQH IAEDIKAHYD DYDGFVILHG TDTMAYTASA LSFMLENLGK PVIVTGSQIP
LAELRSDGQI NLLNALYVAA NYPINEVTLF FNNRLYRGNR TTKAHADGFD AFASPNLPPL
LEAGIHIRRL NTPPAPHGEG ELIVHPITPQ PIGVVTIYPG ISADVVRNFL RQPVKALILR
SYGVGNAPQN KAFLQELQEA SDRGIVVVNL TQCMSGKVNM GGYATGNALA HAGVIGGADM
TVEATLTKLH YLLSQELDTE TIRKAMSQNL RGELTPDD