Gene EcHS_A1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1038 
SymbolasnC 
ID5591641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1049201 
End bp1050601 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content53% 
IMG OID640920205 
Productasparaginyl-tRNA synthetase 
Protein accessionYP_001457770 
Protein GI157160452 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0017] Aspartyl/asparaginyl-tRNA synthetases 
TIGRFAM ID[TIGR00457] asparaginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0000136907 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTTG TGCCTGTAGC CGACGTACTC CAGGGCCGCG TAGCCGTTGA CAGCGAAGTC 
ACCGTACGCG GATGGGTACG TACCCGCCGA GATTCAAAAG CTGGCATCTC CTTCCTCGCC
GTTTATGACG GTTCCTGCTT TGATCCTGTA CAGGCTGTCA TCAATAATTC TCTGCCCAAT
TACAATGAAG ACGTCCTGCG TCTGACCACC GGTTGCTCGG TCATTGTGAC GGGTAAAGTC
GTGGCGTCGC CGGGCCAGGG GCAACAATTT GAAATTCAGG CCAGCAAGGT TGAAGTTGCT
GGTTGGGTTG AAGATCCAGA CACTTACCCG ATGGCGGCAA AACGCCACAG CATTGAGTAT
CTGCGTGAAG TCGCTCACCT GCGTCCGCGC ACAAACCTGA TTGGTGCCGT CGCGCGCGTT
CGCCATACGC TGGCGCAGGC GCTGCATCGC TTCTTTAACG AGCAGGGATT CTTCTGGGTT
TCAACGCCAC TGATTACTGC GTCTGATACC GAAGGGGCTG GCGAAATGTT CCGCGTTTCT
ACGCTGGATC TGGAAAATCT GCCGCGTAAC GATCAGGGCA AAGTGGATTT CGACAAAGAC
TTCTTTGGTA AAGAGTCTTT CCTGACCGTA TCTGGCCAGT TGAACGGCGA AACCTACGCT
TGCGCATTAT CCAAAATTTA TACCTTCGGC CCGACTTTCC GTGCTGAAAA CTCCAACACC
AGCCGTCACC TGGCGGAGTT CTGGATGCTG GAGCCGGAAG TGGCGTTTGC TAACCTGAAC
GATATCGCGG GTCTGGCTGA AGCCATGCTG AAATATGTCT TCAAAGCGGT TCTCGAAGAA
CGCGCAGACG ACATGAAATT CTTCGCTGAA CGCGTAGATA AAGATGCCGT TTCACGTCTG
GAACGCTTCA TTGAAGCCGA TTTTGCGCAG GTGGATTACA CCGACGCAGT GACCATTCTC
GAAAACTGCG GCAGGAAGTT TGAAAACCCA GTTTACTGGG GCGTTGATCT CTCTTCTGAG
CATGAGCGTT ATCTGGCGGA AGAACACTTT AAAGCACCGG TAGTGGTTAA AAACTATCCG
AAAGATATTA AAGCGTTCTA TATGCGCCTT AACGAAGACG GTAAAACCGT TGCGGCTATG
GACGTTCTGG CTCCGGGCAT CGGTGAGATC ATTGGTGGCT CCCAGCGTGA AGAGCGTCTG
GACGTGCTGG ACGAGCGTAT GCTGGAAATG GGCCTGAACA AAGAAGATTA CTGGTGGTAT
CGCGATCTGC GTCGCTACGG TACTGTTCCG CATTCCGGTT TCGGTCTTGG TTTTGAACGC
CTGATTGCTT ACGTAACTGG CGTGCAAAAC GTGCGTGATG TGATTCCGTT CCCACGTACT
CCGCGCAACG CCAGCTTCTG A
 
Protein sequence
MSVVPVADVL QGRVAVDSEV TVRGWVRTRR DSKAGISFLA VYDGSCFDPV QAVINNSLPN 
YNEDVLRLTT GCSVIVTGKV VASPGQGQQF EIQASKVEVA GWVEDPDTYP MAAKRHSIEY
LREVAHLRPR TNLIGAVARV RHTLAQALHR FFNEQGFFWV STPLITASDT EGAGEMFRVS
TLDLENLPRN DQGKVDFDKD FFGKESFLTV SGQLNGETYA CALSKIYTFG PTFRAENSNT
SRHLAEFWML EPEVAFANLN DIAGLAEAML KYVFKAVLEE RADDMKFFAE RVDKDAVSRL
ERFIEADFAQ VDYTDAVTIL ENCGRKFENP VYWGVDLSSE HERYLAEEHF KAPVVVKNYP
KDIKAFYMRL NEDGKTVAAM DVLAPGIGEI IGGSQREERL DVLDERMLEM GLNKEDYWWY
RDLRRYGTVP HSGFGLGFER LIAYVTGVQN VRDVIPFPRT PRNASF