Gene EcSMS35_4422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4422 
SymbolcoaA 
ID6146628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4519648 
End bp4520598 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content44% 
IMG OID641619242 
Productpantothenate kinase 
Protein accessionYP_001746362 
Protein GI170680730 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1072] Panthothenate kinase 
TIGRFAM ID[TIGR00554] pantothenate kinase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000298887 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0000844896 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTATAA AAGAGCAAAC GTTAATGACG CCTTACCTAC AGTTTGACCG CAACCAGTGG 
GCAGCTCTGC GTGATTCCGT ACCTATGACG TTATCGGAAG ATGAGATCGC CCGTCTCAAA
GGTATTAATG AAGATCTCTC GTTAGAAGAA GTTGCCGAGA TCTATTTACC TCTGTCACGT
TTGCTGAACT TCTATATAAG CTCGAATCTG CGCCGTCAGG CAGTTCTGGA ACAGTTTCTT
GGTACCAACG GGCAACGCAT TCCTTACATT ATCAGTATTG CTGGTAGTGT CGCGGTGGGG
AAAAGTACAA CCGCCCGTGT ATTGCAGGCG CTATTAAGCC GTTGGCCGGA ACATCGTCGT
GTTGAACTGA TCACGACTGA TGGCTTCCTT CACCCTAATC AGGTACTGAA AGAACGTGGT
CTGATGAAGA AGAAAGGCTT CCCGGAATCG TATGATATGC ATCGCCTGGT GAAGTTTGTT
TCCGATCTCA AATCCGGCGT GCCAAACGTT ACAGCACCTG TTTACTCGCA TCTTATTTAT
GATGTGATCC CGGATGGAGA TAAAACGGTT GTTCAGCCTG ATATTTTAAT TCTTGAAGGG
TTAAATGTCT TACAGAGCGG AATGGATTAT CCACACGATC CACATCATGT ATTTGTTTCT
GATTTTGTCG ATTTTTCGAT ATATGTTGAT GCACCGGAAG ACTTACTTCA GACGTGGTAT
ATCAACCGTT TTCTGAAATT CCGGGAAGGG GCTTTTACAG ACCCGGATTC CTATTTTCAT
AACTACGCGA AATTAACTAA AGAAGAAGCG ATTAATACTG CCATGACGTT GTGGAAAGAG
ATCAACTGGC TGAACTTAAA GCAAAATATT CTACCTACTC GCGAGCGCGC CAGTTTAATC
CTGACGAAAA GTGCTAATCA TGCGGTAGAA GAGGTCAGAC TACGCAAATA A
 
Protein sequence
MSIKEQTLMT PYLQFDRNQW AALRDSVPMT LSEDEIARLK GINEDLSLEE VAEIYLPLSR 
LLNFYISSNL RRQAVLEQFL GTNGQRIPYI ISIAGSVAVG KSTTARVLQA LLSRWPEHRR
VELITTDGFL HPNQVLKERG LMKKKGFPES YDMHRLVKFV SDLKSGVPNV TAPVYSHLIY
DVIPDGDKTV VQPDILILEG LNVLQSGMDY PHDPHHVFVS DFVDFSIYVD APEDLLQTWY
INRFLKFREG AFTDPDSYFH NYAKLTKEEA INTAMTLWKE INWLNLKQNI LPTRERASLI
LTKSANHAVE EVRLRK