Gene EcSMS35_4112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4112 
SymbolasnA 
ID6144321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4206954 
End bp4207946 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content56% 
IMG OID641618936 
Productasparagine synthetase AsnA 
Protein accessionYP_001746074 
Protein GI170683630 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2502] Asparagine synthetase A 
TIGRFAM ID[TIGR00669] aspartate--ammonia ligase, AsnA-type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0106172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.169697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG CTTACATTGC CAAACAACGT CAAATTAGCT TCGTGAAATC TCACTTTTCC 
CGTCAACTGG AAGAACGCTT AGGGCTGATT GAAGTCCAGG CGCCGATTCT TAGCCGTGTG
GGGGATGGCA CGCAGGATAA CTTGTCGGGC TGTGAAAAAG CGGTGCAGGT AAAAGTGAAA
GCTCTGCCTG ATGCCCAGTT CGAAGTGGTT CATTCACTGG CGAAGTGGAA ACGTCAGACA
TTAGGGCAAC ACGACTTCAG TGCGGGCGAA GGGCTTTACA CGCACATGAA AGCCCTTCGC
CCCGATGAAG ACCGTCTTTC TCCGTTGCAC TCGGTCTATG TTGACCAGTG GGACTGGGAA
CGCGTAATGG CCGACGGCGA GCGTCAATTC TCGACTCTGA AAAGCACGGT AGAGGCGATC
TGGGCGGGAA TTAAAGCAAC CGAAGCTGCG GTTAGCGAAG AGTTTGGCCT GGCACCGTTC
CTGCCGGATC AGATCCACTT CGTACACAGC CAGGAGTTAC TGTCTCGTTA TCCGAATCTT
GATGCCAAAG GGCGTGAGCG GGCGATTGCG AAAGATCTTG GCGCGGTATT TCTCGTCGGG
ATTGGCGGCA AGTTGAGTGA TGGCCATCGT CACGACGTGC GCGCACCGGA TTATGATGAC
TGGAGCACCC CGTCAGAGCT GGGCTATGCG GGGCTGAACG GCGATATTCT GGTGTGGAAC
CCGGTACTGG AAGATGCGTT TGAGCTTTCT TCCATGGGGA TCCGCGTTGA TGCCGACACG
CTGAAGCATC AGCTGGCGCT GACCGGTGAC GAAGATCGCC TGCAACTGGA GTGGCATCAG
GCGCTGCTGC GCGGTGAAAT GCCGCAGACC ATCGGCGGTG GTATCGGCCA GTCTCGTTTG
ACCATGCTGC TGCTGCAACT GCCGCATATC GGCCAGGTTC AGTGTGGAGT ATGGTCAGCG
GCAGTTCGTG AGAGCGTCCC TTCTCTGCTG TAA
 
Protein sequence
MKTAYIAKQR QISFVKSHFS RQLEERLGLI EVQAPILSRV GDGTQDNLSG CEKAVQVKVK 
ALPDAQFEVV HSLAKWKRQT LGQHDFSAGE GLYTHMKALR PDEDRLSPLH SVYVDQWDWE
RVMADGERQF STLKSTVEAI WAGIKATEAA VSEEFGLAPF LPDQIHFVHS QELLSRYPNL
DAKGRERAIA KDLGAVFLVG IGGKLSDGHR HDVRAPDYDD WSTPSELGYA GLNGDILVWN
PVLEDAFELS SMGIRVDADT LKHQLALTGD EDRLQLEWHQ ALLRGEMPQT IGGGIGQSRL
TMLLLQLPHI GQVQCGVWSA AVRESVPSLL