Gene ECH74115_4260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4260 
SymbolansB 
ID6968817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3945673 
End bp3946719 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content53% 
IMG OID643387998 
ProductL-asparaginase II 
Protein accessionYP_002272437 
Protein GI209397059 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00520] L-asparaginases, type II 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTTT TCAAAAAGAC GGCACTTGCC GCACTGGTTA TGGGTTTTAG TGGTGCAGCA 
TTGGCATTAC CCAATATCAC CATTTTAGCA ACCGGCGGGA CCATTGCCGG TGGTGGTGAC
TCCGCAACCA AATCTAACTA CACAGCGGGT AAAGTTGGCG TAGAAAATCT GGTTAATGCG
GTGCCGCAAC TGAAGGACAT TGCGAACGTT AAAGGCGAGC AGGTAGTGAA TATCGGCTCC
CAGGACATGA ACGATGATGT CTGGCTGACA CTGGCGAAAA AAATTAACAC CGACTGCGAT
AAAACCGACG GCTTCGTCAT TACTCACGGT ACCGACACGA TGGAAGAAAC CGCTTACTTC
CTCGACCTGA CGGTGAAATG CGACAAACCG GTGGTGATGG TCGGCGCAAT GCGTCCGTCC
ACGTCTATGA GCGCAGACGG TCCATTCAAC CTGTATAACG CGGTAGTGAC CGCAGCTGAT
AAAGCCTCCG CTAATCGTGG CGTGCTGGTG GTGATGAACG ACACCGTACT GGACGGTCGC
GATGTCACCA AAACCAACAC CACCGACGTA GCGACCTTCA AGTCTGTTAA CTACGGTCCG
CTGGGATACA TTCACAACGG TAAGATTGAC TACCAACGTA CCCCGGCACG TAAGCACACC
AGCGACACGC CGTTCGATGT CTCTAAGCTG AATGAACTGC CGAAAGTCGG CATTGTTTAT
AACTACGCTA ACGCATCCGA TCTTCCGGCT AAAGCCCTGG TAGATGCGGG CTATGATGGC
ATCGTGAGCG CTGGTGTGGG TAACGGCAAC CTGTATAAAT CCGTGTTTGA CACGCTGGCG
ACCGCCGCGA AAAACGGTAC TGCAGTCGTG CGTTCTTCCC GCGTACCGAC GGGCGCTACC
ACTCAGGATG CCGAAGTGGA TGATGCGAAA TACGGTTTTA TTGCCTCTGG TACGCTGAAC
CCGCAAAAAG CGCGCGTCCT GCTGCAACTG GCTCTGACGC AAACCAAAGA TCCGCAGCAG
ATCCAGCAGA TCTTCAATCA GTACTAA
 
Protein sequence
MEFFKKTALA ALVMGFSGAA LALPNITILA TGGTIAGGGD SATKSNYTAG KVGVENLVNA 
VPQLKDIANV KGEQVVNIGS QDMNDDVWLT LAKKINTDCD KTDGFVITHG TDTMEETAYF
LDLTVKCDKP VVMVGAMRPS TSMSADGPFN LYNAVVTAAD KASANRGVLV VMNDTVLDGR
DVTKTNTTDV ATFKSVNYGP LGYIHNGKID YQRTPARKHT SDTPFDVSKL NELPKVGIVY
NYANASDLPA KALVDAGYDG IVSAGVGNGN LYKSVFDTLA TAAKNGTAVV RSSRVPTGAT
TQDAEVDDAK YGFIASGTLN PQKARVLLQL ALTQTKDPQQ IQQIFNQY