Gene ECH74115_1091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1091 
SymbolasnC 
ID6968823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1120412 
End bp1121812 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content53% 
IMG OID643385103 
Productasparaginyl-tRNA synthetase 
Protein accessionYP_002269602 
Protein GI209397026 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0017] Aspartyl/asparaginyl-tRNA synthetases 
TIGRFAM ID[TIGR00457] asparaginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000126096 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.367452 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTG TGCCTGTAGC CGACGTACTC CAGGGCCGCG TAGCCGTTGA CAGCGAAGTC 
ACCGTACGCG GATGGGTACG TACCCGCCGA GATTCAAAAG CTGGCATCTC CTTCCTCGCC
GTTTATGACG GTTCCTGCTT TGATCCTGTA CAGGCTGTCA TCAATAATTC TCTGCCCAAT
TACAATGAAG ACGTCCTGCG TCTGACCACC GGCTGCTCGG TCATTGTGAC GGGTAAAGTC
GTGGCGTCGC CGGGCCAGGG GCAACAATTT GAAATTCAGA CCAGCAAGGT TGAAGTTGCT
GGTTGGGTTG AAGATCCAGA CACTTACCCG ATGGCGGCAA AACGCCACAG CATTGAGTAT
CTGCGTGAAG TCGCTCACCT GCGTCCGCGC ACAAACCTGA TTGGTGCCGT CGCGCGCGTT
CGCCATACGC TGGCGCAGGC GCTGCATCGC TTCTTTAACG AGCAGGGATT CTTCTGGGTT
TCAACGCCAC TGATTACTGC GTCTGATACC GAAGGGGCTG GCGAAATGTT CCGCGTTTCT
ACGCTGGATC TGGAAAATCT GCCGCGTAAC GATCAGGGCA AAGTGGATTT CGACAAAGAC
TTCTTTGGTA AAGAGTCTTT CCTGACCGTA TCTGGCCAGT TGAACGGCGA AACCTACGCT
TGCGCATTAT CCAAAATTTA TACCTTCGGC CCGACTTTCC GTGCTGAAAA CTCCAACACC
AGCCGTCACC TGGCGGAATT CTGGATGCTG GAGCCGGAAG TGGCGTTTGC TAACCTGAAC
GATATCGCGG GTCTGGCTGA AGCCATGCTG AAATATGTCT TCAAAGCGGT TCTCGAAGAA
CGCGCTGACG ACATGAAATT CTTCGCTGAA CGCGTAGATA AAGATGCCGT TTCACGTCTG
GAACGCTTTA TTGAGGCCGA TTTCGCGCAG GTGGATTACA CCGAAGCAGT AACCATTCTC
GAAAACTGCG GCAGGAAGTT TGAAAACCCG GTTTACTGGG GCGTCGATCT CTCTTCTGAG
CATGAGCGTT ATCTGGCGGA AGAACACTTT AAAGCACCGG TAGTGGTTAA AAACTATCCG
AAAGATATTA AAGCGTTCTA TATGCGCCTC AACGAAGACG GTAAAACCGT TGCGGCTATG
GATGTTCTGG CTCCGGGCAT CGGTGAGATC ATTGGTGGCT CCCAGCGTGA AGAGCGTCTG
GACGTGCTGG ACGAGCGTAT GCTGGAAATG GGCCTGAACA AAGAAGATTA CTGGTGGTAT
CGCGATCTGC GTCGCTACGG TACTGTTCCG CATTCCGGTT TCGGTCTTGG TTTTGAACGC
CTGATTGCTT ACGTAACTGG TGTGCAAAAC GTGCGTGATG TGATTCCGTT CCCACGAACT
CCGCGTAACG CCAGCTTCTA A
 
Protein sequence
MSVVPVADVL QGRVAVDSEV TVRGWVRTRR DSKAGISFLA VYDGSCFDPV QAVINNSLPN 
YNEDVLRLTT GCSVIVTGKV VASPGQGQQF EIQTSKVEVA GWVEDPDTYP MAAKRHSIEY
LREVAHLRPR TNLIGAVARV RHTLAQALHR FFNEQGFFWV STPLITASDT EGAGEMFRVS
TLDLENLPRN DQGKVDFDKD FFGKESFLTV SGQLNGETYA CALSKIYTFG PTFRAENSNT
SRHLAEFWML EPEVAFANLN DIAGLAEAML KYVFKAVLEE RADDMKFFAE RVDKDAVSRL
ERFIEADFAQ VDYTEAVTIL ENCGRKFENP VYWGVDLSSE HERYLAEEHF KAPVVVKNYP
KDIKAFYMRL NEDGKTVAAM DVLAPGIGEI IGGSQREERL DVLDERMLEM GLNKEDYWWY
RDLRRYGTVP HSGFGLGFER LIAYVTGVQN VRDVIPFPRT PRNASF