Gene EcSMS35_0812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0812 
Symbol 
ID6146487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp814194 
End bp815435 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content55% 
IMG OID641615700 
Productcardiolipin synthase 2 
Protein accessionYP_001742892 
Protein GI170682929 
COG category[I] Lipid transport and metabolism 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0377423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.805381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGTA GCTGGCGCGA AGGCAATAAG ATCCAGTTGC TGGAAAACGG CGAGCAATAT 
TATCCCGCGG TGTTTAAGGC GATTGGCGAG GCGCAAGAAC GCATCATTCT TGAAACGTTT
ATCTGGTTTG AGGATGACGT CGGCAAACAG CTGCATGCGG CGCTACTGGC AGCAGCGCAA
CGCGGGGTGA AAGCGGAAGT CTTGCTGGAT GGCTACGGTT CGCCGGATCT CAGCGATGAG
TTTGTCAATG AACTGACGGC AGCTGGCGTG GTATTTCGCT ACTACGATCC CCGCCCGCGC
CTTTTTGGTA TGCGCACCAA TGTGTTTCGC CGGATGCATC GCAAAATTGT GGTGATCGAC
GCGCTTATCG CCTTTATTGG CGGGCTGAAT TACTCCGCCG AGCATATGTC CAGCTACGGT
CCAGAGGCCA AACAGGATTA CGCGGTACGC CTTGAAGGGC CGATTGTTGA AGATATCCTC
CAGTTTGAAC AGGAAAATCT GCCAGGACAG AGCGCGGCCC GACGCTGGTG GCGACGTCAT
CACAAAGCGG AAGAAAATCG CCAGCCGGGA GAAGCGCAGG TATTGCTGGT CTGGCGCGAT
AACGAAGAAC ATCGCGATGA TATCGAACGT CACTATCTGA AAATGCTCAC TCAGGCGCGG
CGAGAAGTGA TTATCGCCAA CGCCTACTTC TTCCCCGGCT ATCGATTTTT ACACGCCTTG
CGTAAAGCGG CACGGCGCGG GGTGCGGATC AAACTGATCA TTCAGGGCGA ACCGGATATG
CCGATTGTCA GAGTCGGTGC GCGTTTGCTG TATAACTATC TGGTTAAAGG CGGCGTTCAG
GTGTTTGAGT ACCGCCGCCG TCCGCTACAT GGCAAAGTGG CATTGATGGA CGATCACTGG
GCGACGGTAG GATCCAGTAA TCTCGATCCG CTCAGTTTGT CACTGAATCT CGAAGCAAAT
GTCATCATCC ACGATCGTCA TTTTAACCAG ACGCTGCGTG ATAATCTGAA CGGCATTATC
GCCGCAGATT GTCAGCAGGT GGATGAGACC ATGCTGCCGA AACGCACCTG GTGGAATCTG
ACCAAAAGCG TGCTGGCCTT CCACTTTTTA CGCCACTTCC CGGCGCTGGT CGGCTGGCTT
CCGGCACACA CGCCACGTCT GGCGCAGGTT GATCCCCCCG CACAACCGAC AATGGAAACG
CAGGATCGGG TAGAAACTGA AAACACGGGG GTAAAACCCT GA
 
Protein sequence
MKCSWREGNK IQLLENGEQY YPAVFKAIGE AQERIILETF IWFEDDVGKQ LHAALLAAAQ 
RGVKAEVLLD GYGSPDLSDE FVNELTAAGV VFRYYDPRPR LFGMRTNVFR RMHRKIVVID
ALIAFIGGLN YSAEHMSSYG PEAKQDYAVR LEGPIVEDIL QFEQENLPGQ SAARRWWRRH
HKAEENRQPG EAQVLLVWRD NEEHRDDIER HYLKMLTQAR REVIIANAYF FPGYRFLHAL
RKAARRGVRI KLIIQGEPDM PIVRVGARLL YNYLVKGGVQ VFEYRRRPLH GKVALMDDHW
ATVGSSNLDP LSLSLNLEAN VIIHDRHFNQ TLRDNLNGII AADCQQVDET MLPKRTWWNL
TKSVLAFHFL RHFPALVGWL PAHTPRLAQV DPPAQPTMET QDRVETENTG VKP