Gene EcolC_1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1865 
SymbolansA 
ID6064843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2064814 
End bp2065830 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content51% 
IMG OID641601278 
Productcytoplasmic asparaginase I 
Protein accessionYP_001724840 
Protein GI170019886 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00519] L-asparaginases, type I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.281005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000416053 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCAAAAGA AATCAATTTA CGTTGCCTAC ACGGGCGGGA CCATCGGGAT GCAGCGTTCC 
GAGCAGGGTT ATATACCGGT GTCAGGTCAT CTACAACGCC AACTGGCGCT GATGCCGGAA
TTCCATCGCC CGGAGATGCC AGATTTCACC ATTCATGAAT ATACGCCGCT GATGGATTCT
TCTGATATGA CGCCAGAAGA CTGGCAGCAT ATTGCTGAAG ATATTAAAGC GCACTATGAC
GACTATGATG GTTTTGTCAT TCTGCACGGC ACCGACACGA TGGCGTATAC CGCCTCTGCG
CTGTCGTTCA TGCTCGAAAA TCTCGGTAAA CCGGTCATTG TGACAGGGTC ACAAATCCCG
CTGGCTGAGT TACGCTCTGA CGGACAAATT AATCTGCTGA ATGCGTTGTA CGTTGCGGCG
AATTATCCGA TCAACGAAGT AACGCTCTTT TTTAATAATC GACTGTATCG CGGCAACCGC
ACCACCAAAG CCCATGCCGA TGGTTTTGAT GCGTTTGCCT CTCCAAACCT TCCTCCGTTA
CTGGAAGCAG GTATCCATAT ACGTCGTTTG AATACGCCAC CCGCCCCGCA CGGTGAAGGG
GAATTGATCG TTCATCCAAT CACGCCACAA CCAATTGGCG TAGTGACGAT TTATCCGGGG
ATTTCTGCTG ACGTCGTGCG CAATTTTCTG CGCCAACCGG TGAAAGCATT GATTCTGCGC
TCATATGGCG TGGGTAATGC GCCACAAAAC AAAGCCTTCC TGCAGGAATT ACAAGAAGCC
AGCGATCGCG GTATTGTGGT GGTCAACCTG ACACAATGTA TGTCCGGTAA AGTGAACATG
GGTGGTTATG CCACCGGTAA CGCCCTCGCC CATGCCGGCG TTATTGGCGG TGCAGATATG
ACTGTAGAAG CCACGCTAAC CAAACTGCAT TACCTGCTGA GCCAGGAACT GGATACTGAA
ACCATTCGCA AGGCCATGAG CCAAAACCTG CGCGGCGAAC TGACGCCGGA TGATTAA
 
Protein sequence
MQKKSIYVAY TGGTIGMQRS EQGYIPVSGH LQRQLALMPE FHRPEMPDFT IHEYTPLMDS 
SDMTPEDWQH IAEDIKAHYD DYDGFVILHG TDTMAYTASA LSFMLENLGK PVIVTGSQIP
LAELRSDGQI NLLNALYVAA NYPINEVTLF FNNRLYRGNR TTKAHADGFD AFASPNLPPL
LEAGIHIRRL NTPPAPHGEG ELIVHPITPQ PIGVVTIYPG ISADVVRNFL RQPVKALILR
SYGVGNAPQN KAFLQELQEA SDRGIVVVNL TQCMSGKVNM GGYATGNALA HAGVIGGADM
TVEATLTKLH YLLSQELDTE TIRKAMSQNL RGELTPDD