Gene EcE24377A_2798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2798 
SymbolhisS 
ID5590346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2789699 
End bp2790973 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content55% 
IMG OID640926449 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001463836 
Protein GI157155202 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0241493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAAAAA ACATTCAAGC CATTCGCGGC ATGAACGATT ACCTGCCTGG CGAAACGGCC 
ATCTGGCAGC GCATTGAAGG CACACTGAAA AACGTGCTCG GCAGCTACGG TTACAGTGAA
ATCCGCTTGC CGATTGTAGA GCAGACCCCG CTATTCAAAC GTGCGATTGG TGAAGTCACC
GACGTGGTTG AAAAAGAGAT GTACACCTTT GAGGATCGCA ATGGCGACAG CCTGACTCTG
CGCCCTGAAG GGACGGCGGG CTGTGTACGC GCCGGCATCG AGCATGGTCT TCTGTACAAT
CAGGAACAGC GTCTGTGGTA TATCGGGCCG ATGTTCCGTC ACGAGCGTCC GCAGAAAGGG
CGTTATCGTC AGTTCCATCA GTTGGGCTGC GAAGTTTTCG GTCTGCAAGG TCCGGATATC
GACGCTGAAC TGATTATGCT CACCGCCCGC TGGTGGCGTG CGCTGGGTAT CTCCGAACAC
GTAACTCTTG AGCTGAATTC TATCGGTTCG CTGGAAGCAC GCGCCAATTA CCGCGATGCG
CTGGTGGCAT TCCTTGAGCA GCATAAAGAA AAGCTGGACG AAGACTGCAA ACGCCGCATG
TACACTAACC CGCTGCGCGT GCTGGATTCC AAAAATCCGG AAGTGCAGGC GCTTCTCAAC
GATGCTCCGG CATTAGGCGA TTATCTGGAC GAGGAGTCTC GTGAGCACTT TGCCGGTCTG
TGCAAACTGC TTGAGAGCGC GGGGATCGCT TACACCGTAA ACCAGCGTCT GGTGCGTGGT
CTGGATTACT ATAACCGTAC CGTTTTCGAG TGGGTGACTA ACAGTCTCGG CTCCCAGGGC
ACCGTGTGTG CAGGCGGTCG TTATGACGGT CTTGTGGAAC AACTGGGCGG TCGTGCAACA
CCGGCTGTCG GTTTTGCGAT GGGCCTCGAA CGTCTTGTAT TGTTAGTACA GGCCGTTAAT
CCGGAATTTA AAGCCGATCC TGTTGTCGAT ATATACCTGG TGGCTTCAGG TGCTGATACA
CAATCTGCGG CTATGGCATT AGCTGAGCGT CTGCGTGATG AATTACCGGG CGTGAAATTG
ATGACCAACC ACGGCGGCGG CAACTTTAAG AAACAGTTTG CCCGTGCTGA TAAATGGGGT
GCCCGCGTTG CTGTGGTGCT GGGTGAGTCT GAAGTGGCTA ACGGCACAGC AGTAGTGAAG
GATTTGCGCT CTGGTGAGCA AACGGCAGTT GCGCAGGATA GCGTAGCCGC GCATTTGCGC
ACGTTACTGG GTTAA
 
Protein sequence
MAKNIQAIRG MNDYLPGETA IWQRIEGTLK NVLGSYGYSE IRLPIVEQTP LFKRAIGEVT 
DVVEKEMYTF EDRNGDSLTL RPEGTAGCVR AGIEHGLLYN QEQRLWYIGP MFRHERPQKG
RYRQFHQLGC EVFGLQGPDI DAELIMLTAR WWRALGISEH VTLELNSIGS LEARANYRDA
LVAFLEQHKE KLDEDCKRRM YTNPLRVLDS KNPEVQALLN DAPALGDYLD EESREHFAGL
CKLLESAGIA YTVNQRLVRG LDYYNRTVFE WVTNSLGSQG TVCAGGRYDG LVEQLGGRAT
PAVGFAMGLE RLVLLVQAVN PEFKADPVVD IYLVASGADT QSAAMALAER LRDELPGVKL
MTNHGGGNFK KQFARADKWG ARVAVVLGES EVANGTAVVK DLRSGEQTAV AQDSVAAHLR
TLLG