Gene EcSMS35_2342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2342 
Symbol 
ID6143936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2375643 
End bp2376695 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content54% 
IMG OID641617215 
Productcytochrome c-type biogenesis family protein 
Protein accessionYP_001744387 
Protein GI170680448 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3088] Uncharacterized protein involved in biosynthesis of c-type cytochromes
[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.397636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTTT TATTGGGCGT GCTGATGCTG ATGATCTCCG GCTCAGCGCT GGCGACCATC 
GACGTGTTGC AGTTTAAGGA TGAAGCGCAG GAGCAGCAGT TCCGCCAACT CACTGAAGAA
CTACGCTGCC CGAAATGCCA GAACAACAGC ATTGCCGATT CCAACTCGAT GATTGCCACC
GACCTGCGCC AGAAAGTGTA TGAACTGATG CAGGAAGGTA AAAGTAAGAA AGAGATTGTC
GATTATATGG TGGCGCGTTA CGGCAACTTC GTCACTTACG ATCCGCCGTT AACGCCGCTG
ACCGTGCTGC TGTGGGTGCT TCCGGTAGTG GCTATTGGCA TTGGCGGTTG GGTCATTTAC
GCCCGTTCGC GGCGTCGGGT ACGCGTGGTG CCGGACGCGT TTCCTGAACA AAGCGTGCCG
GAAGGTAAGC GTGCCGGATA TATTGTTTAT CTGCCGGGTA TTGTGGTGGC GTTAATTGTG
GCTGGCGTCA GCTACTACCA GACGGGCAAT TATCAGCAGG TGAAAATCTG GCAGCAGGCC
ACGGCACAGG CTCCGGCGTT ACTGGACAGG GCGCTGGATC CGAAAGCCGA TCCGCTCAAC
GAAGAAGAGA TGTCGCGCCT GGCGCTGGGG ATGCGTACTC AACTGCAAAA AAATCCGGGA
GATATAGAAG GCTGGATTAT GTTGGGCCGC GTTGGCATGG CGCTGGGTAA CGCCAGTATT
GCCACCGATG CATACGCTAC TGCATATCGC CTCGATCCGA AGAACAGTGA TGCGGCGCTC
GGATACGCTG AAGCGTTGAC ACGTTCATCT GATCCCAACG ACAACCGCCT GGGTGGTGAA
CTGCTACGCC AGTTGGTGAG AACTGACCAC AGCAATATCC GTGTGTTAAG CATGTATGCG
TTTAATGCCT TTGAGCAGCA GCGATTTGGC GAAGCCGTTG CCGCGTGGGA GATGATGTTG
AAACTCTTAC CTGCCAATGA TACTCGCCGT GCGGTGATTG AACGTAGTAT CGCGCAGGCG
ATGCAACATT TGTCGCCGCA GGAGAGTAAA TAA
 
Protein sequence
MRFLLGVLML MISGSALATI DVLQFKDEAQ EQQFRQLTEE LRCPKCQNNS IADSNSMIAT 
DLRQKVYELM QEGKSKKEIV DYMVARYGNF VTYDPPLTPL TVLLWVLPVV AIGIGGWVIY
ARSRRRVRVV PDAFPEQSVP EGKRAGYIVY LPGIVVALIV AGVSYYQTGN YQQVKIWQQA
TAQAPALLDR ALDPKADPLN EEEMSRLALG MRTQLQKNPG DIEGWIMLGR VGMALGNASI
ATDAYATAYR LDPKNSDAAL GYAEALTRSS DPNDNRLGGE LLRQLVRTDH SNIRVLSMYA
FNAFEQQRFG EAVAAWEMML KLLPANDTRR AVIERSIAQA MQHLSPQESK