Gene EcHS_A3117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3117 
SymbolansB 
ID5593714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3128600 
End bp3129646 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content53% 
IMG OID640922236 
ProductL-asparaginase II 
Protein accessionYP_001459736 
Protein GI157162418 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00520] L-asparaginases, type II 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTTTT TCAAAAAGAC GGCACTTGCC GCACTGGTTA TGGGTTTTAG TGGTGCAGCA 
TTGGCATTAC CCAATATCAC CATTTTAGCA ACCGGCGGGA CCATTGCCGG TGGTGGTGAC
TCCGCAACCA AATCTAACTA CACAGCGGGT AAAGTTGGCG TAGAAAATCT GGTTAATGCG
GTGCCGCAAC TAAAAGACAT TGCGAACGTT AAAGGCGAGC AGGTAGTGAA TATCGGCTCC
CAGGACATGA ACGATAATGT CTGGCTGACA CTGGCGAAAA AAATTAACAC CGACTGCGAT
AAAACCGACG GCTTCGTCAT TACCCACGGT ACCGACACGA TGGAAGAAAC CGCTTACTTC
CTCGACCTGA CGGTGAAATG CGACAAACCG GTGGTGATGG TCGGCGCAAT GCGCCCGTCC
ACGTCCATGA GCGCAGACGG TCCATTCAAC CTGTATAACG CGGTAGTGAC CGCAGCTGAT
AAAGCCTCCG CTAATCGTGG CGTGCTGGTG GTGATGAACG ACACCGTACT GGACGGTCGC
GATGTCACCA AAACCAACAC CACCGACGTA GCGACCTTCA AGTCTGTTAA CTACGGTCCT
CTGGGATACA TTCACAACGG TAAGATTGAC TACCAACGTA CCCCGGCACG TAAGCACACC
AGCGATACGC CATTCGATGT CTCTAAGCTG AATGAGCTGC CGAAAGTCGG CATCGTTTAT
AACTACGCTA ACGCATCCGA TCTTCCGGCT AAAGCACTGG TAGATGCGGG CTATGATGGC
ATCGTTAGCG CTGGTGTGGG TAATGGTAAC CTGTATAAAT CCGTGTTCGA CACCCTGGCA
ACCGCCGCGA AAAACGGCAC TGCAGTAGTG CGTTCTTCCC GCGTACCGAC GGGTGCTACC
ACTCAGGATG CTGAAGTGGA TGATGCGAAA TACGGCTTCG TCGCCTCTGG CACGCTGAAC
CCGCAAAAAG CGCGCGTCCT GCTGCAGCTG GCTCTGACGC AAACCAAAGA TCCGCAGCAG
ATCCAGCAGA TCTTCAATCA GTACTAA
 
Protein sequence
MEFFKKTALA ALVMGFSGAA LALPNITILA TGGTIAGGGD SATKSNYTAG KVGVENLVNA 
VPQLKDIANV KGEQVVNIGS QDMNDNVWLT LAKKINTDCD KTDGFVITHG TDTMEETAYF
LDLTVKCDKP VVMVGAMRPS TSMSADGPFN LYNAVVTAAD KASANRGVLV VMNDTVLDGR
DVTKTNTTDV ATFKSVNYGP LGYIHNGKID YQRTPARKHT SDTPFDVSKL NELPKVGIVY
NYANASDLPA KALVDAGYDG IVSAGVGNGN LYKSVFDTLA TAAKNGTAVV RSSRVPTGAT
TQDAEVDDAK YGFVASGTLN PQKARVLLQL ALTQTKDPQQ IQQIFNQY