Gene EcolC_3475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3475 
Symbol 
ID6068277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3790213 
End bp3791172 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content52% 
IMG OID641602891 
Productacetyl-CoA carboxylase carboxyltransferase subunit alpha 
Protein accessionYP_001726416 
Protein GI170021462 
COG category[I] Lipid transport and metabolism 
COG ID[COG0825] Acetyl-CoA carboxylase alpha subunit 
TIGRFAM ID[TIGR00513] acetyl-CoA carboxylase, carboxyl transferase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.292571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.288336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGA ATTTCCTTGA TTTTGAACAG CCGATTGCAG AGCTGGAAGC GAAAATCGAT 
TCTCTGACTG CGGTTAGCCG TCAGGATGAG AAACTGGATA TTAACATCGA TGAAGAAGTG
CATCGTCTGC GTGAAAAAAG CGTAGAACTG ACACGTAAAA TCTTCGCCGA TCTCGGTGCA
TGGCAGATTG CGCAACTGGC ACGCCATCCA CAGCGTCCTT ATACCCTGGA TTACGTTCGC
CTGGCATTTG ATGAATTTGA CGAACTGGCT GGCGACCGCG CGTATGCAGA CGATAAAGCT
ATCGTCGGTG GTATCGCCCG TCTCGATGGT CGTCCGGTGA TGATCATTGG TCATCAAAAA
GGTCGTGAAA CCAAAGAAAA AATTCGCCGT AACTTTGGTA TGCCAGCGCC AGAAGGTTAC
CGCAAAGCAC TGCGTCTGAT GCAAATGGCT GAACGCTTTA AGATGCCAAT CATCACCTTT
ATCGACACCC CGGGGGCTTA TCCGGGCGTG GGCGCAGAAG AGCGCGGTCA GTCTGAAGCC
ATTGCACGCA ACCTGCGTGA AATGTCTCGC CTTAGCGTAC CGACTATTTG TACCGTTATC
GGTGAAGGTG GTTCTGGCGG CGCGCTGGCG ATTGGCGTGG GCGATAAAGT GAATATGCTG
CAATACAGCA CCTATTCCGT TATCTCGCCG GAAGGTTGTG CGTCCATTCT GTGGAAGAGC
GCTGATAAAG CGCCGCTGGC GGCTGAAGCG ATGGGTATCA TCGCTCCGCG TCTGAAAGAA
CTAAAACTGA TCGACTCCAT CATCCCAGAA CCGCTGGGTG GTGCTCACCG TAACCCGGAA
GCGATGGCGG CATCGTTGAA AGCGCAACTG TTGGCGGATC TGGCCGATCT CGACGTGTTA
AGCACTGAAG ATTTAAAAAA TCGTCGTTAT CAGCGCCTGA TGAGCTACGG TTACGCGTAA
 
Protein sequence
MSLNFLDFEQ PIAELEAKID SLTAVSRQDE KLDINIDEEV HRLREKSVEL TRKIFADLGA 
WQIAQLARHP QRPYTLDYVR LAFDEFDELA GDRAYADDKA IVGGIARLDG RPVMIIGHQK
GRETKEKIRR NFGMPAPEGY RKALRLMQMA ERFKMPIITF IDTPGAYPGV GAEERGQSEA
IARNLREMSR LSVPTICTVI GEGGSGGALA IGVGDKVNML QYSTYSVISP EGCASILWKS
ADKAPLAAEA MGIIAPRLKE LKLIDSIIPE PLGGAHRNPE AMAASLKAQL LADLADLDVL
STEDLKNRRY QRLMSYGYA