Gene EcolC_3670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3670 
SymbollplA 
ID6065956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4019779 
End bp4020795 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content58% 
IMG OID641603085 
Productlipoate-protein ligase A 
Protein accessionYP_001726608 
Protein GI170021654 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0095] Lipoate-protein ligase A 
TIGRFAM ID[TIGR00545] lipoyltransferase and lipoate-protein ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.632003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACAT TACGCCTGCT CATCTCTGAC TCTTACGACC CGTGGTTTAA CCTGGCGGTG 
GAGGAGTGTA TTTTTCGCCA GATGCCCGCC ACGCAGCGCG TTCTGTTTCT CTGGCGCAAT
GCCGACACGG TAGTGATTGG TCGCGCGCAG AACCCGTGGA AAGAGTGTAA TACCCGGCGG
ATGGAAGAAG ATAACGTCCG CCTGGCACGG CGCAGTAGCG GTGGCGGCGC GGTGTTCCAC
GATCTCGGCA ATACCTGCTT TACCTTTATG GCTGGCAAGC CGGAGTACGA TAAAACCATC
TCCACGTCGA TTGTGCTCAA TGCGCTGAAC GCGCTTGGCG TCAGCGCCGA AGCGTCCGGG
CGTAACGATC TGGTGGTGAA AACCGCCGAA GGCGACCGCA AAGTCTCAGG ATCGGCCTAT
CGCGAAACCA AAGATCGTGG CTTCCACCAC GGCACCTTGC TGCTCAATGC CGACCTTAGC
CGCCTGGCAA ACTATCTCAA TCCGGATAAA AAGAAACTGG CGGCGAAAGG CATTACCTCA
GTGCGTTCCC GCGTGACCAA CCTCACCGAG CTGCTGCCGG GGATCACCCA TGAGCAGGTT
TGCGAGGCCA TAACCAAGGC CTTTTTCGCC CATTATGGCG AGCGTGTAGA AGCGGAAATC
ATCTCCCCGG ACAAAACGCC AGACTTGCCA AACTTCGCCG AAACCTTTGC CCGTCAGAGT
AGCTGGGAAT GGAACTTCGG TCAGGCTCCG GCATTCTCGC ATCTGCTGGA TGAACGCTTT
AGCTGGGGCG GCGTGGAACT GCATTTCGAC GTTGAAAAAG GCCATATCAC CCGCGCCCAG
GTGTTTACCG ACAGCCTCAA CCCCGCGCCG CTGGAAGCCC TCGCCGGGCG ACTGCAAGGC
TGCCTGTACC GCGCGGATAT GCTGCAACAA GAGTGCGAAG CGCTGTTGGT TGACTTCCCG
GACCAGGAAA AAGAGCTACG GAAGTTGTCG ACGTGGATAG CGGGGGCGGT AAGGTAA
 
Protein sequence
MSTLRLLISD SYDPWFNLAV EECIFRQMPA TQRVLFLWRN ADTVVIGRAQ NPWKECNTRR 
MEEDNVRLAR RSSGGGAVFH DLGNTCFTFM AGKPEYDKTI STSIVLNALN ALGVSAEASG
RNDLVVKTAE GDRKVSGSAY RETKDRGFHH GTLLLNADLS RLANYLNPDK KKLAAKGITS
VRSRVTNLTE LLPGITHEQV CEAITKAFFA HYGERVEAEI ISPDKTPDLP NFAETFARQS
SWEWNFGQAP AFSHLLDERF SWGGVELHFD VEKGHITRAQ VFTDSLNPAP LEALAGRLQG
CLYRADMLQQ ECEALLVDFP DQEKELRKLS TWIAGAVR