Gene EcolC_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2666 
SymbolasnC 
ID6066122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2921374 
End bp2922774 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content53% 
IMG OID641602072 
Productasparaginyl-tRNA synthetase 
Protein accessionYP_001725622 
Protein GI170020668 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0017] Aspartyl/asparaginyl-tRNA synthetases 
TIGRFAM ID[TIGR00457] asparaginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000363068 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0518169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTG TGCCTGTAGC CGACGTACTC CAGGGCCGCG TAGCCGTTGA CAGCGAAGTC 
ACCGTACGCG GATGGGTACG TACCCGCCGA GATTCAAAAG CTGGCATCTC CTTCCTCGCC
GTTTATGACG GTTCCTGCTT TGATCCTGTA CAGGCTGTCA TCAATAATTC TCTGCCCAAT
TACAATGAAG ACGTCCTGCG TCTGACCACC GGTTGCTCGG TCATTGTGAC GGGTAAAGTC
GTGGCGTCGC CGGGCCAGGG GCAACAATTT GAAATTCAGG CCAGCAAGGT TGAAGTTGCT
GGTTGGGTTG AAGATCCAGA CACTTACCCG ATGGCGGCAA AACGCCACAG CATTGAGTAT
CTGCGTGAAG TCGCTCACCT GCGTCCGCGC ACAAACCTGA TTGGTGCCGT CGCGCGCGTT
CGCCATACGC TGGCGCAGGC GCTGCATCGC TTCTTTAACG AGCAGGGATT CTTCTGGGTT
TCAACGCCAC TGATTACTGC GTCTGATACC GAAGGGGCTG GCGAAATGTT CCGCGTTTCT
ACGCTGGATC TGGAAAATCT GCCGCGTAAC GATCAGGGCA AAGTGGATTT CGACAAAGAC
TTCTTTGGTA AAGAGTCTTT CCTGACCGTA TCTGGCCAGT TGAACGGCGA AACCTACGCT
TGCGCATTAT CCAAAATTTA TACCTTCGGC CCGACTTTCC GTGCTGAAAA CTCCAACACC
AGCCGTCACC TGGCGGAGTT CTGGATGCTG GAGCCGGAAG TGGCGTTTGC TAACCTGAAC
GATATCGCGG GTCTGGCTGA AGCCATGCTG AAATATGTCT TCAAAGCGGT TCTCGAAGAA
CGCGCAGACG ACATGAAATT CTTCGCTGAA CGCGTAGATA AAGATGCCGT TTCACGTCTG
GAACGCTTCA TTGAAGCCGA TTTTGCGCAG GTGGATTACA CCGACGCAGT GACCATTCTC
GAAAACTGCG GCAGGAAGTT TGAAAACCCA GTTTACTGGG GCGTTGATCT CTCTTCTGAG
CATGAGCGTT ATCTGGCGGA AGAACACTTT AAAGCACCGG TAGTGGTTAA AAACTATCCG
AAAGATATTA AAGCGTTCTA TATGCGCCTT AACGAAGACG GTAAAACCGT TGCGGCTATG
GACGTTCTGG CTCCGGGCAT CGGTGAGATC ATTGGTGGCT CCCAGCGTGA AGAGCGTCTG
GACGTGCTGG ACGAGCGTAT GCTGGAAATG GGCCTGAACA AAGAAGATTA CTGGTGGTAT
CGCGATCTGC GTCGCTACGG TACTGTTCCG CATTCCGGTT TCGGTCTTGG TTTTGAACGC
CTGATTGCTT ACGTAACTGG CGTGCAAAAC GTGCGTGATG TGATTCCGTT CCCACGTACT
CCGCGCAACG CCAGCTTCTG A
 
Protein sequence
MSVVPVADVL QGRVAVDSEV TVRGWVRTRR DSKAGISFLA VYDGSCFDPV QAVINNSLPN 
YNEDVLRLTT GCSVIVTGKV VASPGQGQQF EIQASKVEVA GWVEDPDTYP MAAKRHSIEY
LREVAHLRPR TNLIGAVARV RHTLAQALHR FFNEQGFFWV STPLITASDT EGAGEMFRVS
TLDLENLPRN DQGKVDFDKD FFGKESFLTV SGQLNGETYA CALSKIYTFG PTFRAENSNT
SRHLAEFWML EPEVAFANLN DIAGLAEAML KYVFKAVLEE RADDMKFFAE RVDKDAVSRL
ERFIEADFAQ VDYTDAVTIL ENCGRKFENP VYWGVDLSSE HERYLAEEHF KAPVVVKNYP
KDIKAFYMRL NEDGKTVAAM DVLAPGIGEI IGGSQREERL DVLDERMLEM GLNKEDYWWY
RDLRRYGTVP HSGFGLGFER LIAYVTGVQN VRDVIPFPRT PRNASF