Gene EcSMS35_0693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0693 
SymbolasnB 
ID6143656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp697033 
End bp698697 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content53% 
IMG OID641615583 
Productasparagine synthetase B 
Protein accessionYP_001742782 
Protein GI170683596 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0367] Asparagine synthase (glutamine-hydrolyzing) 
TIGRFAM ID[TIGR01536] asparagine synthase (glutamine-hydrolyzing) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000880701 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTTCAA TTTTTGGCGT ATTCGATATC AAAACAGACG CAGTTGAGCT GCGGAAGAAA 
GCCCTCGAGC TGTCACGCCT GATGCGTCAT CGTGGCCCGG ACTGGTCCGG TATTTATGCC
AGCGATAACG CCATTCTCGC TCACGAACGT CTGTCAATTG TTGACGTTAA CGCAGGGGCG
CAACCTCTCT ACAACCAACA AAAAACCCAC GTACTGGCGG TAAACGGTGA AATCTACAAC
CACCAGGCAC TGCGCGCCGA ATATGGCGAT CGTTATCAGT TCCAGACCGG ATCTGACTGC
GAAGTGATCC TCGCGCTGTA TCAGGAAAAA GGGCCGGAAT TTCTTGACGA CTTGCAGGGC
ATGTTTGCCT TTGCCCTGTA CGACAGCGAA AAAGATGCCT ACCTGATTGG TCGCGACCAT
CTGGGGATCA TCCCACTGTA TATGGGCTAT GACGAACATG GTCAGCTGTA TGTGGCCTCA
GAAATGAAAG CCCTGGTGCC GGTATGCCGC ACGATTAAAG AGTTCCCGGC GGGGAGCTAT
TTGTGGAGTC AGGACGGCGA AATCCGTTCT TATTATCATC GCGACTGGTT CGACTACGAT
GCGGTGAAAG ATAACGTGAC CGACAAAAAC GAGCTGCGTC AGGCACTGGA AGATTCCGTT
AAAAGCCATC TGATGTCTGA TGTGCCTTAC GGTGTGCTGC TATCTGGTGG CCTGGATTCC
TCAATCATTT CCGCTATCAC CAAGAAATAC GCCGCCCGTC GCGTGGAAGA TCAGGAACGC
TCTGAAGCCT GGTGGCCGCA GCTGCACTCC TTTGCTGTAG GTCTGCCGGG TTCACCGGAT
CTGAAGGCAG CCCAGGAAGT GGCAAACCAT CTGGGCACTG TGCATCATGA AATTCACTTC
ACTGTACAGG AAGGTCTGGA CGCCATCCGC GACGTGATTT ACCACATCGA AACCTATGAC
GTGACGACAA TCCGCGCTTC AACACCGATG TATTTAATGT CGCGTAAGAT CAAGGCGATG
GGCATTAAAA TGGTGCTGTC CGGTGAAGGA TCTGATGAAG TGTTTGGCGG TTATCTTTAC
TTCCACAAAG CGCCGAATGC CAAAGAACTG CATGAAGAGA CGGTGCGTAA ACTGCTGGCC
CTGCATATGT ATGACTGCGC GCGTGCCAAC AAAGCGATGT CAGCCTGGGG CGTGGAAGCA
CGCGTTCCGT TCCTCGACAA AAAATTCCTC GACGTGGCGA TGCGCATTAA CCCACAGGAT
AAAATGTGCG GTAACGGCAA AATGGAAAAA CACATCCTGC GTGAATGTTT TGAAGCGTAT
CTGCCCGCAA GCGTGGCCTG GCGGCAGAAA GAGCAGTTCT CCGATGGCGT CGGTTACAGT
TGGATCGACA CCCTGAAAGA AGTGGCGGCG CAGCAGGTTT CTGATCAACA GCTGGAAACT
GCCCGCTTCC GCTTCCCGTA CAACACACCG ACCTCAAAAG AAGCGTATCT GTACCGGGAG
ATCTTTGAAG AACTGTTCCC GCTTCCGAGC GCCGCTGAGT GCGTGCCGGG CGGTCCTTCC
GTCGCGTGTT CTTCCGCTAA AGCGATTGAG TGGGATGAAG CGTTCAAGAA AATGGACGAT
CCGTCTGGTC GCGCGGTTGG TGTTCACCAG TCGGCATATA AATAA
 
Protein sequence
MCSIFGVFDI KTDAVELRKK ALELSRLMRH RGPDWSGIYA SDNAILAHER LSIVDVNAGA 
QPLYNQQKTH VLAVNGEIYN HQALRAEYGD RYQFQTGSDC EVILALYQEK GPEFLDDLQG
MFAFALYDSE KDAYLIGRDH LGIIPLYMGY DEHGQLYVAS EMKALVPVCR TIKEFPAGSY
LWSQDGEIRS YYHRDWFDYD AVKDNVTDKN ELRQALEDSV KSHLMSDVPY GVLLSGGLDS
SIISAITKKY AARRVEDQER SEAWWPQLHS FAVGLPGSPD LKAAQEVANH LGTVHHEIHF
TVQEGLDAIR DVIYHIETYD VTTIRASTPM YLMSRKIKAM GIKMVLSGEG SDEVFGGYLY
FHKAPNAKEL HEETVRKLLA LHMYDCARAN KAMSAWGVEA RVPFLDKKFL DVAMRINPQD
KMCGNGKMEK HILRECFEAY LPASVAWRQK EQFSDGVGYS WIDTLKEVAA QQVSDQQLET
ARFRFPYNTP TSKEAYLYRE IFEELFPLPS AAECVPGGPS VACSSAKAIE WDEAFKKMDD
PSGRAVGVHQ SAYK