Gene EcHS_A1851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1851 
SymbolansA 
ID5591435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1868942 
End bp1869958 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content50% 
IMG OID640920995 
Productcytoplasmic asparaginase I 
Protein accessionYP_001458547 
Protein GI157161229 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00519] L-asparaginases, type I 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.0116887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGA AATCAATTTA CGTCGCCTAC ACGGGCGGGA CCATCGGGAT GCAGCGTTCC 
GAGCAGGGTT ATATTCCGGT GTCAGGTCAT CTACAACGCC AACTGGCGCT GATGCCGGAA
TTCCATCGCC CGGAGATGCC AGATTTCACC ATTCATGAAT ATACGCCGTT GATGGATTCT
TCTGATATGA CTCCAGAAGA CTGGCAGCAT ATTGCTGAAG ATATTAAAGC GCACTATGAC
GACTATGATG GTTTTGTCAT TCTGCACGGC ACCGATACGA TGGCATATAC CGCCTCTGCG
CTGTCGTTCA TGCTCGAGAA TCTCGGTAAA CCAGTCATTG TGACAGGGTC ACAAATCCCG
CTAGCTGAGT TACGCTCTGA CGGACAAATT AATCTGCTGA ATGCGTTGTA CGTTGCGGCG
AATTATCCGA TCAACGAAGT AACGCTCTTT TTCAATAACC GATTGTATCG CGGCAACCGC
ACTACCAAAG CCCATGCCGA TGGTTTTGAT GCGTTTGCCT CTCCAAACCT TCCTCCGTTA
CTGGAAGCAG GTATCCATAT TCGTCGTTTA AATACGCCAC CCGCCCCGCA CGGTGAAGGG
GAATTAATCG TTCATCCAAT CACCCCACAA CCAATTGGCG TAGTGACGAT TTATCCAGGG
ATTTCTGCTG ACGTCGTGCG CAATTTTCTG CGCCAACCGG TGAAAGCATT GATTTTGCGC
TCCTATGGCG TGGGTAATGC GCCACAAAAC AAAGCCTTCC TGCAGGAATT ACATGAAGCC
AGCGATCGCG GTATTGTGGT GGTCAACCTG ACACAATGTA TGTCCGGTAA AGTGAACATG
GGTGGTTATG CCACCGGTAA CGCCCTCGCC CATGCCGGCG TTATTGGCGG TGCAGATATG
ACTGTAGAAG CCACGCTAAC CAAACTGCAT TACCTGCTGA GCCAGGAACT GGATACTGAA
ACCATTCGCA AGGCCATGAG CCAAAACCTG CGCGGCGAAC TGACGCCGGA TGATTAA
 
Protein sequence
MQKKSIYVAY TGGTIGMQRS EQGYIPVSGH LQRQLALMPE FHRPEMPDFT IHEYTPLMDS 
SDMTPEDWQH IAEDIKAHYD DYDGFVILHG TDTMAYTASA LSFMLENLGK PVIVTGSQIP
LAELRSDGQI NLLNALYVAA NYPINEVTLF FNNRLYRGNR TTKAHADGFD AFASPNLPPL
LEAGIHIRRL NTPPAPHGEG ELIVHPITPQ PIGVVTIYPG ISADVVRNFL RQPVKALILR
SYGVGNAPQN KAFLQELHEA SDRGIVVVNL TQCMSGKVNM GGYATGNALA HAGVIGGADM
TVEATLTKLH YLLSQELDTE TIRKAMSQNL RGELTPDD