Gene EcSMS35_0732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0732 
SymbolgltA 
ID6146855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp738010 
End bp739293 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content51% 
IMG OID641615622 
Producttype II citrate synthase 
Protein accessionYP_001742821 
Protein GI170683629 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.482725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.945982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATA CAAAAGCAAA ACTCACCCTC AACGGGGACA CAGCTGTTGA ACTGGATGTG 
CTGAAAGGCA CGCTGGGTCA AGATGTTATT GATATCCGTA CTCTCGGTTC AAAAGGTGTG
TTCACCTTTG ACCCAGGCTT CACTTCAACC GCATCCTGCG AATCTAAAAT TACTTTTATT
GATGGTGATG AAGGTATTTT GCTGCACCGC GGTTTCCCGA TCGATCAGCT GGCGACCGAT
TCTAACTACC TGGAAGTTTG TTACATCCTG CTGAATGGTG AAAAACCAAC TCAGGAACAG
TATGACGAAT TTAAAACTAC GGTGACCCGT CATACCATGA TCCACGAGCA GATTACCCGT
CTGTTCCATG CTTTCCGTCG CGACTCGCAT CCAATGGCAG TCATGTGTGG TATTACCGGC
GCGCTGGCGG CGTTCTATCA CGACTCGCTG GATGTTAACA ATCCTCGTCA TCGTGAAATT
GCCGCGTTCC GCCTGCTGTC GAAAATGCCG ACCATGGCCG CGATGTGTTA CAAGTATTCC
ATTGGTCAGC CATTCGTTTA CCCGCGCAAC GATCTCTCCT ATGCCGGTAA CTTCCTGAAT
ATGATGTTCT CCACGCCGTG CGAACCGTAT GAAGTTAATC CGATTCTGGA ACGTGCTATG
GACCGTATTC TGATCCTGCA CGCTGACCAT GAACAGAACG CCTCTACCTC CACCGTGCGT
ACCGCTGGCT CTTCGGGGGC GAACCCGTTT GCCTGTATCG CAGCAGGTAT TGCTTCACTG
TGGGGACCTG CGCACGGTGG TGCTAACGAA GCGGCGCTGA AAATGCTGGA AGAAATCAGC
TCCGTTAAAC ACATTCCGGA ATTTGTTCGT CGTGCGAAAG ACAAAAATGA CTCTTTCCGC
CTGATGGGCT TCGGGCACCG CGTGTACAAA AATTACGACC CGCGCGCCAC CGTAATGCGT
GAAACCTGCC ATGAAGTGCT GAAAGAGCTG GGTACCAAAG ATGACCTGCT GGAAGTGGCT
ATGGAGCTGG AAAACATCGC GCTGAACGAC CCGTACTTTA TCGAGAAGAA ACTCTACCCG
AACGTCGATT TCTACTCTGG TATCATCCTG AAAGCGATGG GTATTCCGTC TTCTATGTTC
ACCGTGATCT TCGCGATGGC GCGTACCGTT GGCTGGATCG CCCACTGGAG CGAAATGCAC
AGTGACGGTA TGAAGATTGC CCGTCCGCGT CAGCTGTATA CAGGATATGA AAAACGCGAC
TTTAAAAGCG ATATCAAGCG TTAA
 
Protein sequence
MADTKAKLTL NGDTAVELDV LKGTLGQDVI DIRTLGSKGV FTFDPGFTST ASCESKITFI 
DGDEGILLHR GFPIDQLATD SNYLEVCYIL LNGEKPTQEQ YDEFKTTVTR HTMIHEQITR
LFHAFRRDSH PMAVMCGITG ALAAFYHDSL DVNNPRHREI AAFRLLSKMP TMAAMCYKYS
IGQPFVYPRN DLSYAGNFLN MMFSTPCEPY EVNPILERAM DRILILHADH EQNASTSTVR
TAGSSGANPF ACIAAGIASL WGPAHGGANE AALKMLEEIS SVKHIPEFVR RAKDKNDSFR
LMGFGHRVYK NYDPRATVMR ETCHEVLKEL GTKDDLLEVA MELENIALND PYFIEKKLYP
NVDFYSGIIL KAMGIPSSMF TVIFAMARTV GWIAHWSEMH SDGMKIARPR QLYTGYEKRD
FKSDIKR